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Method for amplification of nucleic 
acids of low complexity 

5 The present invention relates to a method for the ampli- 
fication of nucleic acids. 

Description 

10 This invention relates to the fields of genetic engineer- 
ing, molecular biology and computer science, and more 
specifically to the field of nucleic acid analysis based 
on specific nucleic acid amplification. 

15 The matter of the present invention is a method for am- 
plifying nucleic acids, such as DNA by means of an enzy- 
matic amplification step, such as a polymerase chain re- 
action, specified for template nucleic acids of low com- 
plexity, e.g. pre-treated DNA, like but not limited to 

2 0 DNA pre-treated with bisulfite. The invention is based on 
the use of specific oligo-nucleotide primer molecules to 
solely amplify specific pieces of DNA. It is disclosed 
how to optimize the primer design for a PCR if the tem- 
plate DNA is of unusually low complexity. Also, for the 

25 optimal primer design it was considered that the treated 
template DNA is single stranded. 

The amplification of nucleic acids relies mainly on a 
method called polymerase chain reaction (PCR) . The PCR is 

30 based on the activity of the enzyme DNA polymerase, which 
is elongating primer molecules, which bind to the tem- 
plate DNA by adding dNTPs and hereby copying the template 
sequence (Saiki RK, Gelfand DH, Stoeffel S, Scharf SJ, 
Higuchi R, Horn T, Mullis KB and Erlich HA (1988) . 

35 Primer-directed enzymatic amplification of DNA with a 

thermostable DNA polymerase. Science 239 : 487-491). The 
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primer molecules are designed to specifically hybridize 
to those regions of the template DNA that define both 
ends of the amplificate. The forward primer binds to the 
5' end of the sense strand of the amplificate, whereas 
5 the reverse primer binds to the 5' end of the reverse 

strand, hereby defining the starting points of the poly- 
merase reaction and eventually determining the length of 
the amplificate. 

10 Before the polymerase starts the template DNA gets dena- 
tured, this is usually done by a short cycle of heating 
the reaction mixture up to about 95°C, then cooling it 
down to the annealing temperature determined by the melt- 
ing temperature of the primer molecules used and finally 

15 allowing the polymerase to elongate the annealed primers 
at its ideal working temperature for some minutes. This 
cycle is repeated several times each starting with the 
denaturation step. The primer molecules hybridize to the 
single stranded DNA. The forward primer is the starting 

2 0 molecule for a copy of the sense strand and the reverse 

primer is the starting molecule for a copy of the anti- 
sense strand. 

These first copies will be of unspecific length, limited 
25 only by the polymerase's activity. However in the follow- 
ing cycle, the forward primer will also bind to the first 
copy of the anti-sense strand, the polymerase will take 
that copy as a template and will elongate the primer only 
as far as there is template DNA. Hereby the length of the 

3 0 second copy gets limited to the length defined by the 

first nucleotide of the second primer. In the following 
cycles more and more pieces of template DNA compete for 
the primer molecules and eventually the DNA amplificate 
of defined length will be the main product. 



35 
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However, in the case of a bisulfite treated DNA the tem- 
plate DNA is single stranded. The bisulfite or similar 
treatment alters the original sequences on both strands 
such that these are not complementary to each other after 
5 the treatment. As a result no complementary strand to the 
target sequence exists. A first primer molecule binds to 
the one end of the single stranded target sequence. The 
polymerase elongates said primer and copies said target 
sequence. The second primer molecule cannot bind to the 

10 complementary, so called anti-sense strand, as it would 
in a standard PCR. Therefore the second primer molecule 
is designed to bind to the first copied sequence instead. 
More specifically it will bind to that part of the copied 
nucleic acid which is the complement to the other end of 

15 said target sequence. 

The results of a PCR are highly depending on the choice 
of the ideal primer. The choice of a primer molecule must 
respect constraints permitting a correct amplification by 

2 0 PCR, fulfilling hybridization temperature conditions and 

auto- or hetero-hybridization prevention. 

In other words, as any PCR requires two primer molecules 
to amplify a specific piece of DNA in one reaction the 
25 melting temperatures of both primers need to be very 

similar in order to allow proper binding of both at the 
same hybridization temperature. That is why most primer 
design programs require the user to define a preferred 
melting temperature or a permitted range of melting tem- 

3 0 peratures. This requirement becomes the limiting factor 

when designing primers for a so called multiplex PCR, as 
all primer pairs in use need to have the same or at least 
very similar melting temperatures. Additionally primers 
have to be very specific, in order to only amplify those 
35 pieces of DNA that are the target. 
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By providing the means for designing extremely accurate 
primer pairs for DNA hybridization procedures this inven- 
tion relates to the so called PCR primer design. More 
specifically the body of this invention relates to the 
5 specific requirements of primers and therefore of primer 
design when using template DNA that consists of essen- 
tially only three different nucleotides and is single 
stranded. This is the case when using bisulfite treated 
DNA as a template, as it contains no cytosine other than 
10 the methylated cytosines in a CG dinucleotide and a rest 
of insufficiently treated and therefore untransf ormed 
non-methylated cytosines. The invention relates specifi- 
cally to the primer design when using bisulfite treated 
DNA as template. 

15 

It would be obvious to an individual skilled in the art 
that the use of the primers as specified in this inven- 
tion are not limited to nucleic acid amplification. Said 
primers can be used for several purposes, such as ampli- 
20 fication, but also for nucleic acid sequencing or as 

blocking oligonucleotides during analysis of bisulfite 
treated DNA. Therefore the use of said primers is not 
limited to nucleic acid amplification but extends to all 
standard molecular biological methods. 

25 

Pairs of these primers are used to specifically amplify 
DNA from a small amount of sample DNA that consists of 
bisulfite treated DNA originating from a limited source 
of DNA like a bodily fluid or tissue sample. 

30 

DNA can occur methylated or non-methylated at certain po- 
sitions and this information is relevant for the status 
of a genes transcription. The methyl group is attached to 
the cytosine bases in CpG positions. The identification 
35 of 5-methylcytosine in a DNA sequence as opposed to un- 

methylated cytosine is of greatest importance for example 
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when studying the role of DNA methylation in tumorigene- 
sis. But, because the 5-Methylcytosine behaves just as a 
cytosine for what concerns its hybridization preference 
(a property relied upon for sequence analysis) its posi- 
5 tions can not be identified by a normal sequencing reac- 
tion. Furthermore in a PCR amplification this relevant 
epigenetic information, methylated cytosine or unmethy- 
lated cytosine, will be lost completely. 

10 This problem is usually solved by treating the genomic 

DNA with a chemical leading to a conversion of the cyto- 
sine bases, which consequently allows to differentiate 
the bases afterwards. 

15 A tool most useful for analyzing DNA methylation is the 
bisulfite conversion of DNA that converts cytosine bases 
into bases showing a hybridization behavior as thymin 
bases. Hereby the DNAs complexity is reduced by a fourth. 

20 Bisulfite conversion is the most frequently used method 
for analyzing DNA for 5-methylcytosine . It is based upon 
the specific reaction of bisulfite with cytosine which, 
upon subsequent alkaline hydrolysis, is converted to 
uracil, whereas 5-methylcytosine remains unmodified under 

25 these conditions (Shapiro et al. (1970) Nature 227: 

1047) . However, in its base pairing behavior, uracil cor- 
responds to thymine, that is, it hybridizes to adenine; 
whereas 5-methylcytosine doesn't change its chemical 
properties under this treatment and therefore still has 

30 the base pairing behavior of a cytosine, that is hybrid- 
izing with guanine. Consequently, the original DNA is 
converted in such a manner that methyl-cytosine, which 
originally could not be distinguished from cytosine by 
its hybridization behavior, can now be detected as the 

35 only remaining cytosine using "normal" molecular biologi- 
cal techniques, for example, by amplification and hy- 
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bridization or sequencing. All of these techniques are 
based on base pairing which can now be fully exploited. 
Comparing the sequences of the DNA prior to and after bi- 
sulfite treatment allows an easy identification of those 
5 bases that have been methylated. 

In the scope of this invention when it says "a nucleotide 
(...) was converted by the treatment..." this conversion is 
meant to be able to differentiate between methylated and 
10 un-methylated cytosine bases within said sample, as for 

example the conversion of un-methylated cytosine bases to 
bases which hybridize to adenine by the treatment with 
bisulfite. 

15 An alternative method is to use restriction enzymes that 
are capable of differentiating between methylated and un- 
methylated DNA, but this is restricted in its uses due to 
the selectivity of the restriction enzyme towards a spe- 
cific sequence. 

20 

An overview of the further known methods of detecting 5- 
methylcytosine may be gathered from the following review 
article: Rein T, DePamphilis ML, Zorbas H, Nucleic Acids 
Res. 1998, 26, 2255. 

25 

In terms of sensitivity, the prior art is defined by a 
method, which encloses the DNA to be analyzed in an aga- 
rose matrix, thus preventing the diffusion and renatura- 
tion of the DNA (bisulfite reacts with single- stranded 

3 0 DNA only) , and which replaces all precipitation and puri- 
fication steps with fast dialysis (Olek A, Oswald J, Wal- 
ter J (1996) A modified and improved method for bisulfite 
based cytosine methylation analysis. Nucleic Acids Res. 
24: 5064-6). Using this method, it is possible to analyze 

35 individual cells, which illustrates the potential of the 
method. 
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To date, barring few exceptions (e.g., Zeschnigk M, Lich 
C, Buiting K, Doerfler W, Horsthemke B (1997) A single- 
tube PCR test for the diagnosis of Angelman and Prader- 
5 Willi syndrome based on allelic methylation differences 
at the SNRPN locus. Eur J Hum Genet. 5: 94-8) the bisul- 
fite technique is only used in research. Always, however, 
short, specific fragments of a known gene are amplified 
subsequent to a bisulfite treatment and either completely 

10 sequenced (Olek A, Walter J (1997) The pre- implantation 
ontogeny of the H19 methylation imprint. Nat Genet. 3: 
275-6) or individual cytosine positions are detected by a 
primer extension reaction (Gonzalgo ML and Jones PA 
(1997) Rapid quantitation of methylation differences at 

15 specific sites using methylation-sensitive single nucleo- 
tide primer extension (Ms-SNuPE) . Nucleic Acids Res. 25 
: 2529-31; WO 95/00669) or by enzymatic digestion (Xiong 
Z, Laird PW (1997) COBRA: a sensitive and quantitative 
DNA methylation assay. Nucleic Acids Res. 25: 2532-4). 

20 

Another technique to detect hypermethylation is the so 
called methylation specific PCR (MSP) (Herman JG, Graff 
JR, Myohanen S, Nelkin BD and Baylin SB (1996), Methyla- 
tion-specif ic PCR: a novel PCR assay for methylation 

25 status of CpG islands. Proc Natl Acad Sci USA. 93: 9821- 
6) . The technique is based on the use of primers that 
differentiate between a methylated and a non-methylated 
sequence if applied after bisulfite treatment of said DNA 
sequence. The primer either contains a guanine at the po- 

3 0 sition corresponding to the cytosine in which case it 

will after bisulfite treatment only bind if the position 
was methylated. Or the primer contains an adenine at the 
corresponding cytosine position and therefore only binds 
to said DNA sequence after bisulfite treatment if the cy- 

3 5 tosine was unmethylated and has hence been altered by the 
bisulfite treatment so that it hybridizes to adenine. 
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With the use of these primers amplicons can be produced 
specifically depending on the methylation status of a 
certain cytosine and will as such indicate its methyla- 
tion state. The present invention, however, does prefera- 
5 bly not include CpGs in the primer sequence. 

Another new technique is the detection of methylation via 
Taqman PCR, also known as MethylLight (WO 00/70090). With 
this technique it became feasible to determine the methy- 
10 lation state of single or of several positions directly 

during PCR, without having to analyze the PCR products in 
an additional step. 

In addition, detection by hybridization has also been de- 
15 scribed (WO 99/28498). 

Further publications dealing with the use of the bisul- 
fite technique for methylation detection in individual 
genes are : 

2 0 Grigg G, Clark S (1994) Sequencing 5-methylcytosine resi- 
dues in genomic DNA. Bioassays 16: 431-6; Zeschnigk M, 
Schmitz B, Dittrich B, Buiting K, Horsthemke B, Doerfler 
W (1997) Imprinted segments in the human genome: differ- 
ent DNA methylation patterns in the Prader-Willi /Angelman 

2 5 syndrome region as determined by the genomic sequencing 

method. Hum Mol Genet. 6: 387-95; Feil R, Charlton J, 
Bird AP, Walter J, Reik W (1994) Methylation analysis on 
individual chromosomes: improved protocol for bisulphite 
genomic sequencing. Nucleic Acids Res. 22: 695-6; Martin 

3 0 V, Ribieras S, Song-Wang X, Rio MC, Dante R (1995) Ge- 

nomic sequencing indicates a correlation between DNA hy- 
pomethylation in the 5 ' region of the pS2 gene and its 
expression in human breast cancer cell lines. Gene 157 : 
261-4; WO 97/46705; WO 95/15373; WO 97/45560 

35 
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For all those methods mentioned above, which are based on 
PCR amplification of bisulfite treated DNA, the biggest 
challenge is to design primers that are specific. 

5 THE PROBLEM AND ITS SOLUTION 

There are a number of programs available on the market 
that offer to design primer pairs in order to amplify a 
piece of DNA in a PCR. Usually they require as input the 
10 template DNA sequence, the preferred melting temperature 
TM, the desired length of the amplificate and optionally 
the preferred length of the primer molecules. 

However if a primer is required to bind specifically to 
15 bisulfite treated DNA, the design of the primer molecule 
is especially difficult and those tools known in the art 
are not competent to design primers that lead to specific 
products. The following problems occur when dealing with 
bisulfite treated DNA instead of standard DNA: 

20 

First, the sequence complexity of the bisulfite treated 
genome is reduced dramatically. Complexity in this con- 
text is meant to be a measure for the similarity of a 
given sequence to a random or stochastic sequence; the 

25 more complex a sequence is the more it is similar to a 

random sequence. A reduced complexity of the genome means 
there are less degrees of variation. Where there are es- 
sentially only three different nucleotides rather than 
four, the probability of a sequence to occur twice in a 

30 given length of sequence is much higher. For example, a 
primer molecule of 20 nucleotides in length is likely to 
be unique in the human genome, if it is not part of a re- 
peat sequence: The human genome is known to consist of 
about 3 x 109 bases. There are 420 * 1012 different ways 

35 to form sequences of a length of 20 nucleotides, assuming 
equidistribution of the bases, which makes multiple oc- 



1386-WO 



10 

currences of a given 20-mer (oligonucleotide of 2 0 nu- 
cleotides) extremely unlikely. However since there are 
only 320 * 3 x 109 different 20-mers possible over a 3- 
letter alphabet, this multiple occurrence cannot be ex- 
5 eluded. In addition a bisulfite treated sequence, en- 
riched in thymine in the sense strand and enriched in 
adenine in the reverse complementary strand, will contain 
more repeats and regions of general low complexity. 

10 Another way to enhance or guarantee uniqueness of primer 
and/or oligo molecules is to estimate their expected fre- 
quency in the genome based upon a Markov model of order n 
for the human genome or to check their uniqueness explic- 
itly by counting their exact occurrence. The estimation 

15 based upon the Markov model relies upon the determination 
of the probabilities of all 4n n-mers (oligo molecules of 
n nucleotides) in the human genome or in all amplificates 
which are used in the hybridization and the conditional 
probabilities of all four bases given these n-mers. The 

20 primer pairs will be constructed from forward and reverse 
oligos which lie within an appropriate distance to each 
other and which have minimal individual expected occur- 
rence elsewhere in the genome. 

25 A second challenge in primer design for bisulfite treated 
DNA is that the melting temperature TM of a bisulfite DNA 
primer of a certain length is typically lower than the 
melting temperature TM of a standard primer containing 
cytosines. This is due to the fact that every cytosine in 

30 a bisulfite treated DNA is - after amplification by PCR - 
replaced by thymine. Cytosine binds its corresponding 
base guanine via three hydrogen bonds, whereas thymine 
binds its corresponding base adenine via two hydrogen 
bonds only, leading to a generally weaker binding, a 

35 lower TM. 
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A third problem arises from the fact that bisulfite 
treated sequences are not only lacking cytosines but are 
also thymine-rich. Thymine also hybridizes unspecif ically 
with guanine. This makes mismatching (unspecif ic binding 
5 of a primer to a sequence not identical) of a primer de- 
signed for bisulfite treated DNA much more likely than 
mismatching of a standard primer consisting of four dif- 
ferent nucleotides. 

10 It is the aim of this invention to overcome these prob- 
lems, which are specific for primer based amplification 
of bisulfite treated DNA. 

For a so called "multiplex PCR" it becomes especially 
15 difficult to design primer pairs. This expression is used 
to describe an experiment in which several different 
pieces of DNA are amplified simultaneously, in one reac- 
tion vessel and at the same time. Obviously this saves a 
lot of effort and time and is as such a basic requirement 
20 for high throughput assays based on PCR amplification. An 
overview on the state of the art concerning multiplex PCR 
is given by Henegariu et al. (Henegariu 0, Heerema NA, 
Dlouhy SR, Vance GH and Vogt PH (1997) Multiplex PCR: 
Critical Parameters and Step-by-Step Protocol. BioTech- 
25 niques 23: 504-511), who offer a step-by-step protocol on 
how to tackle multiplex PCR problems. However, the possi- 
bility of a special primer design is not mentioned in 
this article. 

30 To ensure that the multiplex PCR works and the multiple 

products are amplified indeed usually a gel electrophore- 
sis of the reaction mixture is performed. The products 
get separated due to their different sizes. Unfortu- 
nately, the ability of agarose gel electrophoresis to 

35 distinguish the products is slightly limited. However, it 
is possible to test for different product sizes with the 
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means of a fragment analyzer, which is much more accurate 
and able to distinguish product sizes of one base differ- 
ence. Hence different product sizes are no longer a re- 
quirement to be considered in the primer design for a 
5 multiplex PCR. 

In patent WO 01/94634 a method for a multiplex PCR using 
at least two primer pairs is described that consists of 
basically a two step amplification procedure wherein one 

10 step is referred to as pre-amplif ication. After pre- 

amplif ication (by means of PCR) with a number of primer 
pairs the sample gets divided into as many portions as 
there are primer pairs. At least one (and preferably only 
one) of the previously used primer pairs is added. This 

15 method doesn't relate in any way to the selection or de- 
sign of primer molecules described herein. 

In an article by Shuber et al. (Shuber AP, Grondin VJ and 
Klinger KW (1995) A simplified procedure for developing 

20 multiplex PCRs. Genome Res 5 (5) : 488-493 ) regarding 

multiplex PCR, the authors suggest to use primers, which 
contain a 3' region complementary to sequence specific 
recognition sites and a 5' region of a defined length of 
20 nucleotides each. The authors claim that they could 

25 establish identical reaction conditions, cycling times 
and annealing temperatures for any PCR primer pair fol- 
lowing those requirements . 

In several recent papers successful multiplex PCRs have 
30 been established. For example, Becker et al . have re- 
ported the development of a multiplex PCR reaction for 
the detection of multiple staphylococcal enterotoxin 
genes, which uses individual primer sets for each toxin 
gene (Becker K, Roth R and Peters G (1998) Rapid and spe- 
35 cific detection of toxigenic Staphylococcus aureus : use 
of two multiplex PCR enzyme immunoassays for amplifica- 
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tion and hybridization of staphylococcal enterotoxin 
genes, exfoliative toxin genes, and toxic shock syndrome 
toxin 1 gene. J. Clin. Microbiol. 36: 2548-2553). This 
has been developed even further by Monday and Bohach, by 
5 increasing the number of primer pairs applied in one re- 
action up to about 10 in order to have one assay to am- 
plify all of the characterized enterotoxin genes. This 
still required a unique established primer pair for the 
detection of every individual gene (Monday SR and Bohach 
10 GA (1999) Use of multiplex PCR to detect classical and 
newly described pyrogenic toxin genes in staphylococcal 
isolates. J. Clin. Microbiol. 37: 3411-3414). 

In another paper by Sharma et al . a method for a one- 

15 vessel-multiplex PCR is described wherein each of six 
chosen primer pair consists of one identical universal 
forward primer, based on a highly conserved region of 
those genes of interest and one reverse primer, specific 
for each individual gene. As such the assay leads to a 

2 0 rapid amplification of a family of genes, which all have 
a conserved region in common. It is designed to detect 
presence or absence of certain genes in an unknown 
mixture. No further information is given about the primer 
design, apart from saying that they were designed by 

25 alignment of published DNA sequences. This is certainly 

not the only requirement though, as one big limitation of 
the method is the need of getting PCR products of 
different sizes in order to identify those in the end 
(Sharma NK, Rees CED and Dodd CER (2000) Development of a 

30 single-reaction multiplex PCR toxin typing assay for 

Staphylococcus aureus strains. Applied and Environmental 
Microbiology 66 (4) : 1347-1353). 

In the patent application WO 01/36669 a method is 
35 described which uses a similar approach for the 
controllable amplification of a higher number of 
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sequences in selecting one randomly chosen reverse primer 
that hybridizes unspecif ically and a number of specific 
forward primers to amplify a group of sequences. As the 
reverse primer is labeled all products formed will be 
5 labeled as well. By hybridizing said amplicons towards 
immobilized detection oligos, which are able to 
differentiate the products, it will be easy to see which 
products have been amplified and herein the presence or 
absence of said sequences in the mixture can be 
10 determined. 

The big disadvantage in all these methods is that every 
primer pair needs to be established individually first to 
ensure that a PCR product of the expected size was pro- 

15 duced and that no additional or nonspecific products are 
generated. Once the specificity of the primer pairs had 
been determined, PCR conditions, buffers, and primer con- 
centrations need to be optimized to establish conditions 
under which the primer molecules can be combined into one 

2 0 single PCR reaction without affecting the ability of the 
primer pairs to generate a gene specific amplicon. 

A more recently published approach by Nicodeme and 
Steyaert describes the conditions required for multiplex 

25 PCR and suggests an algorithm to automatically select for 
primer pairs (Nicodeme P and Steyaert JM (1997) Selecting 
optimal oligonucleotide primers for multiplex PCR. Proc . 
Int. Conf. Intell Syst Mol Biol; 5 : 210-213). In this 
approach the conditions for pre-selecting primer pairs 

30 for a successful one locus amplification (singleplex PCR 
conditions) are rather broad. The three basic require- 
ments are the pairing distance between a forward and a 
reverse primer, the condition of non-palindromicity of a 
primer, and the condition that the 3' end of a primer 

35 must not be reverse complementary to any of the other 

primers sequence. This selection is done with the help of 
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a typical primer design program called PRIMER. However, 
PRIMER is a two step program, and in this approach the 
new method to design primers for a multiplex PCR takes 
the output from step 1 as input, which is a list of pos- 
5 sible forward and a list of possible reverse primers for 
every amplificate. 

The only further selection criteria for the multiplex PCR 
primers are the absence of the reverse complementarity of 

10 their 3' end towards the other primer sequences in the 
experiment. A second critical factor considered here is 
the GC versus AT ratio. To some extent it is this ratio 
that determines the melting temperature of a primer pair. 
The authors suggest to limit the GC/AT ratio to be inside 

15 a given range which would enable the simultaneous hy- 
bridization of several primer pairs at one reaction tem- 
perature. The final requirement is the electrophoresis 
distance, determined by the tool that is used to differ- 
entiate the PCR products in, for example, a gel electro- 

20 phoresis. This most common method requires the products 
to be of different sizes. The whole concept of this 
method also requires to have a pool of possible primer 
pairs for each amplicon. 

25 The design of suitable primers for a multiplex PCR on 

bisulfite treated DNA is an even greater challenge. The 
low complexity of the DNA, being reduced to essentially 
three different bases rather than four different bases, 
requires an extra careful selection of primers to avoid 

30 mismatching and unwanted amplification. 

In the scope of this invention the word "mismatching" 
corresponds to the situation when the alignment of two 
sequences which are essentially complementary reveals po- 
3 5 sitions in one of the sequences where the nucleotide base 
does not align with its corresponding base but a differ- 
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ent one. The corresponding or complementary base pairs 
are adenine and thymine, cytosine and guanine, uracil and 
adenine. For example, a cytosine that aligns with a thy- 
mine in its otherwise complementary sequence creates a 
5 mismatch of one base or nucleotide. 

Accordingly "base mismatches" refers to the situation of 
a base mismatching with another as explained above, re- 
spectively "one or more base mismatches" refers to one or 
10 more bases (in a given sequence) that cannot be aligned 
with their corresponding bases. 

Also, when the alignment reveals single nucleotide gaps 
in one of the aligned sequences this is understood under 
15 the term "mismatch" in the scope of this invention. 

A x gap' is to be understood as follows: If an alignment 
reveals that, in order to get the highest number of cor- 
responding base pairs aligned, some bases are lacking a 
2 0 corresponding base in its otherwise complementary se- 
quence, this is called a gap. Such a gap can have a 
length of one or more nucleotides . 

To solve the problems mentioned above we invented a 
2 5 method consisting of several steps that is applicable for 
the amplification of nucleic acids in singleplex as well 
as in multiplex PCR experiments. 

SUMMARY OF THE INVENTION 

30 

The method is comprised of the following steps: 
Firstly, the nucleic acid sample containing the region of 
interest, which is to be amplified, is isolated. Sec- 
ondly, this nucleic acid sample is treated in a manner 
35 that differentiates between methylated and un-methylated 
cytosine bases within said sample. Thirdly, a reaction 
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mixture is set up containing a) the treated template nu- 
cleic acids, carrying the region of interest (also 
called: target nucleic acid) that is to be amplified, b) 
specified oligo-nucleotide primers, c) an enzyme capable 
5 of amplifying said nucleic acids in a defined manner, d) 
the necessary nucleotides required for the nucleic acid 
synthesis and e) a suitable buffer . 

Said specified oligo-nucleotide primers are characterized 
10 in that 

their sequences each reach a predefined measure of com- 
plexity (as described in detail below) 
every possible combination of two primer molecules in 
said reaction mixture has a melting temperature below a 
15 specified threshold temperature 

none of the possible combinations of two primer molecules 
in said reaction mixture leads to the amplification of an 
additional unwanted product as determined by virtual 
testing for amplification. 

20 

In the last step of the method said amplified target nu- 
cleic acid is detected by means commonly used by one 
skilled in the art. 

25 The invention is composed of a method for the amplifica- 
tion of nucleic acids comprising the following steps of 
isolating a nucleic acid sample, treating said sample in 
a manner that differentiates between methylated and un- 
methylated cytosine bases within said sample, amplifying 

30 at least one target sequence, within said treated nucleic 
acid, by means of enzymatic amplification and a set of 
primer molecules, wherein said primer molecules are char- 
acterized in that 

a) each primer molecule sequence reaches a predefined 
35 measure of complexity, b) every combination of any two 

primer molecules in the set has a melting temperature be- 
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low a specified threshold temperature and c) every combi- 
nation of two primer molecules, under conditions allowing 
for one or more base mismatches per primer, does not lead 
to the amplification of an unwanted product when virtu- 
5 ally tested using the treated and the untreated sample 

nucleic acids as template and the last step of detecting 
said amplified target nucleic acid. 

More detailed description of the method: 

10 

The method is comprised of the following steps: 
In the first step of the method, the nucleic acid sample, 
which contains the region of interest that is to be am- 
plified, must be isolated from tissue or cellular 
15 sources. Such sources may include at least one cell, but 
usually several cells, cell lines, histological slides, 
bodily fluids, or tissue embedded in paraffin. 

In a preferred embodiment of this invention the nucleic 
20 acid sample is isolated from a bodily fluid, a cell cul- 
ture, a tissue sample or a combination thereof. 

For example a certain kind of organ sample from a patient 
or an animal can be used to extract genomic DNA by the 

25 usually applied methods. Preferably, in this invention 
DNA is extracted from a tissue sample or a biological 
fluid like blood, serum, urine or other fluids. 'Bodily 
fluid' herein refers to a mixture of macromolecules ob- 
tained from an organism. This includes, but is not lim- 

30 ited to, blood, blood plasma, blood serum, urine, sputum, 
ejaculate, semen, tears, sweat, saliva, lymph fluid, 
bronchial lavage, pleural effusion, peritoneal fluid, 
meningal fluid, amniotic fluid, glandular fluid, fine 
needle aspirates, nipple aspirate fluid, spinal fluid, 

35 conjunctival fluid, vaginal fluid, duodenal juice, pan- 
creatic juice, bile and cerebrospinal fluid. This also 
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includes experimentally separated fractions of all of the 
preceding. 'Bodily fluid' also includes solutions or mix- 
tures containing homogenized solid material, such as fe- 
ces . 

5 

The nucleic acids may include DNA or RNA. Isolation may 
be by means that are standard to one skilled in the art, 
this includes for example extraction of DNA with the use 
of detergent lysates, sonification and vortexing with 
10 glass beads. An example is the extraction of DNA from a 

piece of a plant, like a leave or fruit. Once the nucleic 
acids, like genomic double stranded DNA, have been ex- 
tracted they are used in the analysis. 

15 In a preferred embodiment of this invention the nucleic 
acid sample is comprised of plasmid DNA, BACs (bacterial 
artificial chromosomes), YACs (yeast artificial chromo- 
somes) or genomic DNA. 

2 0 In another especially preferred embodiment of this inven- 
tion the nucleic acid sample is comprised of human ge- 
nomic DNA. It is preferred that the nucleic acids are of 
human origin. 

2 5 In the second step, this nucleic acid sample is treated 

in a manner that differentiates between methylated and 
un-methylated cytosine bases within said sample. Cytosine 
bases which are unmethylated at the 5' -position are con- 
verted to uracil, thymine, or another base which is dis- 

3 0 similar to cytosine in terms of hybridization behavior. 

This will be understood as 'treatment' hereinafter. The 
method most commonly used so far is the so called bisul- 
fite treatment. 

35 This step is of essential meaning to the process as it 

translates the methylation pattern of said nucleic acids 
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into a pattern that is something like an imprint of the 
methylation status itself. It contains essentially the 
same information but the pre-treated nucleic acids are no 
longer sensitive to amplification via PCR. Amplification 
5 via PCR does not differentiate between methylated and un- 
methylated cytosines and therefore leads to the loss of 
this level of information. The original methylation 
status however can be deducted whenever the described 
pre-treatment had been performed prior to the amplifica- 
10 tion step. Hence any means suitable to differentiate be- 
tween a methylated and an un-methylated cytosine base are 
applicable, as long as the modified bases are still capa- 
ble of being amplified by enzymatic means after treat- 
ment . 

15 

It is a preferred embodiment of this invention that said 
sample is treated by means of a solution of a bisulfite, 
hydrogen sulfite or disulfite. A treatment of genomic DNA 
as described above is carried out with bisulfite (hydro- 
20 gen sulfite, disulfite) and subsequent alkaline hydroly- 
sis which results in a conversion of non-methylated cyto- 
sine nucleobases to uracil or to another base which is 
dissimilar to cytosine in terms of base pairing behavior. 

25 In the third step of this method, a reaction mixture is 
set up containing a) the treated template nucleic acids, 
comprising the region of interest (also called target nu- 
cleic acid) that is to be amplified, b) specified oli- 
gonucleotide primers, c) an enzyme capable of amplifying 

30 said nucleic acids in a defined manner, for example a po- 
lymerase, d) the necessary nucleotides required for the 
nucleic acid synthesis and e) a suitable buffer. The tem- 
plate nucleic acid contains at least one target nucleic 
acid, which is amplified in the reaction. One primer 

35 molecule of the at least one primer pair in the reaction 
mixture is capable of binding to the 3' end of one speci- 
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fied target nucleic acid. The first primer binds to the 
3' end of the target sequence, this primer is elongated 
and a complementary sequence to the target sequence is 
made. The polymerase stops to elongate unspecif ically . 
5 The next cycle starts by thermally denaturing the now 
double stranded template nucleic acid into single 
stranded template nucleic acids. This is followed by the 
next phase of annealing when both primer molecules spe- 
cifically bind to the target nucleic acid and its comple- 
10 mentary strand. The second primer is identical to the 5' 
end of the target molecule. It doesn't bind to the target 
sequence itself but to said complementary nucleic acid to 
the target sequence, as soon as this is denatured from 
the template. 

15 

The process is finished by the actual amplification phase 
at a slightly lower reaction temperature, during which 
the enzyme, for example the polymerase elongates the 
primer as a complementary sequence to the target nucleic 

20 acid. The polymerase elongates this second primer by us- 
ing the first copy as template until the end of said cop- 
ied nucleic acid is reached. That way an identical copy 
to the original single stranded target nucleic acid is 
created. Hence, the length of the amplificate is deter- 

25 mined by choosing the two primers. 

The elongation products, being complementary to each 
other and hereby building a double stranded version of 
the target nucleic acid, serve as additional targets for 
30 the primer molecules binding in the next cycle of ampli- 
fication. 

Essentially step 3 of the method is comprised of amplify- 
ing at least one target sequence, within said treated nu- 
35 cleic acid, by means of enzymatic amplification and a set 
of primer molecules. 
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Said primer molecules used in said method are character- 
ized in that they, in addition to fulfilling all the 
usual requirements towards a PCR primer as will be speci- 
5 fied in more detail later, also fulfill the following re- 
quirements : 

Firstly, the sequence of each primer molecule used in 
step 3 of this method reaches a predefined measure of 
10 complexity. 

In a preferred embodiment of this method the primer mole- 
cules are reaching a certain value of linguistic complex- 
ity. A notion and a measure of linguistic complexity has 

15 been introduced by Trifonov in 1990 and has been used for 
analysis of nucleotide sequences before (Trifonov, EN 
(1990) Making sense of the human genome. In Structure & 
Methods. Vol 1 pp 69-77 (eds. Sarma, RH and Sarma MH, 
Adenine Press, Albany, US) . The linguistic complexity 

20 technique allows a calculation to be made of the struc- 
tural complexity of any linear sequence of characters ir- 
respective of whether the text is cognized or presently 
undeciphered. The sequences are compared exclusively from 
the point of view of their structural complexity with no 

2 5 reference to the meaning of the texts. In 1997 Trifonov 
published how the linguistic complexity of nucleosomal 
sequences is defined (Bolshoy, A; Shapiro, K; Trifonov, E 
and Ioshikhes I. (1997) Enhancement of the nucleosomal 
pattern in sequences of lower complexity. NAR 25 (16): 

30 3248-3254) . Quote: "The linguistic complexity measure ex- 
ploits the major distinguishing feature between natural 
nucleotide sequences and uniformly random ones: the re- 
petitiveness of the natural sequences, i.e. the frequent 
repetition, not necessarily a tandem one, of some oli- 

35 gonucleotides ("words"), while others are avoided. (...) 
Complexity can be directly calculated as the extent to 
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which the maximal possible vocabulary (all word sizes 
considered) is utilized in a given strength of sequence 
(...) ." 

5 In another preferred embodiment of this method said meas- 
ure of complexity is set by the so called Shannon entropy 
(Shannon, C E, (1948) A Mathematical Theory of Communica- 
tion, University of Illinois Press, Urbana) . This is the 
most common measure to assess the information content (in 

10 a technical, non-semantic meaning) of linear information 
carriers. It attributes the maximal value (which can be 
chosen to be 1 without restrictions) to sequences where 
all symbols (characters) occur at equal probability and a 
value of 0 to sequences consisting of just one repeated 

15 symbol (character, letter) . A derived and more general 

measure is the higher order Shannon entropy which attrib- 
utes maximal value to sequences where all its subse- 
quences occur at equal probability and a value of 0 or 
close to 0 to sequences consisting of periodic repeti- 

20 tions of short subsequences. The practical determination 
of the (higher order) Shannon entropy however is limited 
by the finite lengths of sequences which often does not 
permit a precise estimation of the probability distribu- 
tion of their constitutive symbols. 

25 

Further possible measures are for example the Lempel-Ziv 
complexity (Lempel, LB and Ziv, J (197 6) On the complex- 
ity of finite sequences. IEEE Trans. Inf. Theory IT-22, 
75-81) , the grammar complexity (Ebeling, W; Jimenez- 
30 Montano, MA (1980) On Grammars, Complexity and Informa- 
tion Measures of Biologoical Macromolecules . Mathematical 
Bioscience 52, 53-71), the algorithmic complexity 
(Chaitin, 1990) and the conditial entropy. 

35 Secondly, said primer molecules are also characterized in 
that every possible combination of any two primer mole- 
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cules, in the set, has a melting temperature below a 
specified threshold temperature. That way the accumula- 
tion of dimers caused by the binding of two primer mole- 
cules to each other in said reaction mixture is excluded. 
5 The number of primer pairs used in that step can be any 
between one and n, leading to one or n amplif icates re- 
spectively (n being a natural number) . 

As mentioned in the text the word "dimer" refers to a 
10 secondary structure formed by the hybridization of two 
primer molecules to each other. 

As referred to in the text 'melting temperature' refers 
to the temperature at which 50% of the nucleic acid mole- 
15 cules are in duplex and 50% are denatured under standard 
reaction solution conditions. 

Some primer design tools disqualify a primer if, besides 
the target sequence, a second identical sequence can be 

20 found in the template. However, due to the higher prob- 
ability of a bisulfite primer to mismatch with non- 
identical bisulfite treated DNA, it is an embodiment of 
this invention that only those primers are allowed to be 
used in said amplification method, for which no sequence 

25 homology can be found, to the extent that even those se- 
quences that are different and/or mismatching in several 
nucleotides are excluded. However, this would exclude 
primer molecules unnecessarily. Therefore they are only 
excluded if two primer molecules match to the template in 

30 a distance allowing for the amplification of an unwanted 
product. This test is performed by means as, for example, 
the Electronic PCR. Electronic PCR (e-PCR) is an in 
silico virtual PCR carried out in order to assess the 
suitability of primer molecules prior to in vitro PCR. In 

35 the scope of this invention this testing will be called 
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'virtual testing' and it will be referred to as * virtu- 
ally tested" or "virtually testing" . 

Thirdly, the primers used in step 3 of this invention are 
5 characterized, in that every possible combination of two 
primer molecules, in said reaction mixture, does not lead 
to the amplification of an additional unwanted product, 
when virtually testing for amplification using the 
treated and the untreated nucleic acid sample as t em- 
it) plate, even under conditions allowing for at least one 
base but not more than 20% of the total number of bases 
per sequence mismatching per primer. In the scope of this 
invention it is to be understood that those primer mole- 
cules are considered to bind to the template for which a 
15 template sequence exists that is in at least 80% of its 
nucleotide sequence identical to the target sequence the 
primer originally has been designed for. For example, a 
primer molecule of 50 nucleotides length is considered to 
still hybridize to a template sequence that differs in 
2 0 less than 11 nucleotides (= is identical in at least 80% 
of its nucleotide sequence) from the according target se- 
quence. If a match is considered to be possible it has to 
be tested whether this match would lead to the amplifica- 
tion of an unwanted product. This can be done with the 
25 use of a program similar to e-PCR (see below) . 

Especially preferred is an embodiment of said method 
wherein the ability of said primer molecules to amplify 
an unwanted product is tested by means of in silico PCR, 

30 taking as template nucleic acid the coding strand of the 
treated sample, the non-coding strand of the treated sam- 
ple and both of the strands of the untreated sample. It 
is especially preferred to perform the virtual testing 
with a tool like electronic PCR on the pretreated, pref- 

35 erably bisulfite treated, template sequence consisting of 
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the treated sense and the treated anti-sense strand, and 
on the unconverted template. 

Furthermore it is preferred that this treatment is bisul- 
5 fite treatment and hence the nucleic acid template is the 
bisulfite converted coding strand of the human genome, 
the bisulfite converted non-coding strand of the human 
genome and both of the strands of the untreated human ge- 
nome. Preferred is an embodiment of said method wherein 

10 the ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR, 
hereby taking as template nucleic acid the bisulfite con- 
verted coding strand of the human genome, the bisulfite 
converted non-coding strand of the human genome and both 

15 of the strands of the untreated human genome. 

It is preferred that the number of mismatches allowed for 
when virtually testing the amplification of unwanted 
products according to step 3 c) of the invention is less 
20 than 2 0% of the number of nucleotides of the primer. 

It is also preferred that the number of mismatches al- 
lowed for when virtually testing the amplification of un- 
wanted products according to step 3 c) of the invention 

2 5 is less than 10% of the number of nucleotides of the 

primer. 

It is especially preferred that the number of mismatches 
allowed for when virtually testing the amplification of 

3 0 unwanted products according to step 3c) of the invention 

is less than 5% of the number of nucleotides of the 
primer. 

It is a preferred embodiment of this invention that the 
35 number of mismatches allowed for when virtually testing 
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the amplification of unwanted products according to step 
3c) of the invention is less than seven. 

It is especially preferred that the number of mismatches 
5 allowed for when virtually testing the amplification of 

unwanted products according to step 3 c) of the invention 
is less than five. 

It is another preferred embodiment of this invention that 
10 the number of mismatches allowed for when virtually test- 
ing the amplification of unwanted products according to 
step 3c) of the invention is less than three. 

It is especially preferred in the scope of this invention 
15 that the number of mismatches allowed for when virtually 
testing the amplification of unwanted products according 
to step 3c) of the invention is one. 

It is also included in the scope of this invention to 

20 consider such primer molecules as being sufficiently 

similar to facilitate their binding to the template se- 
quence, for which a template sequence can be found that 
differs in the number of nucleotides but is otherwise 
identical to the target sequence. When the alignment of 

2 5 the primer and the template sequence leads to a gap of up 
to 20% of the nucleotides of one sequence, preferably of 
the primer sequence, this shall still be considered to be 
sufficient for binding and hence potentially leading to 
the amplification of an unwanted product. Therefore these 

30 primers also need to be tested with the means of virtual 
PCR (for example with a program like e-PCR) . Only if this 
test reveals the virtual amplification of an unwanted 
product caused by the combination of two primers, the ac- 
cording primer pairs are excluded from the set of se- 

35 lected pairs . 
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It is preferred that the number of nucleotides creating 
one gap, in one of the sequences, when aligning the 
primer molecule sequence with the template sequence, al- 
lowed for when virtually testing the amplification of un- 
5 wanted products according to step 3 c) of the invention 
is less than 20% of the number of nucleotides of the 
primer molecule. 

It is also preferred that the number of nucleotides cre- 
10 ating one gap, in one of the sequences, when aligning the 
primer molecule sequence with the template sequence, al- 
lowed for when virtually testing the amplification of un- 
wanted products according to step 3 c) of the invention 
is less than 10% of the number of nucleotides of the pri- 
15 mer molecule. 

It is preferred that the number of nucleotides creating 
one gap, in one of the sequences, when aligning the 
primer molecule sequence with the template sequence, al- 
20 lowed for when virtually testing the amplification of un- 
wanted products according to step 3 c) of the invention 
is less than 5% of the number of nucleotides of the 
primer molecule. 

25 Both of these situations, mismatching due to an alterna- 
tive nucleotide or no-matching due to a missing nucleo- 
tide, are meant to be covered in the expression describ- 
ing those primer molecules that will eventually be se- 
lected : "said primer molecules are characterized in that 

30 every combination of two primer molecules, under condi- 
tions allowing for one or more base mismatches per 
primer, does not lead to the amplification of an unwanted 
product when virtually tested using the treated and the 
untreated sample nucleic acids as template". 



35 
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It is also preferred that the primer molecules that 
exceed a pre-specif ied melting temperature when binding 
to the template have to be virtually tested for 
amplification of unwanted products using the treated and 
5 the untreated sample nucleic acids as template according 
to step 3 c) of the method. 

The basic problem of finding a primer specific enough to 
give only one product on the little complex bisulfite 

10 DNA, is finally solved by testing each potential primer 
pair for hybridization across the whole bisulfite con- 
verted human genome. This requires translating the whole 
human genome sequence information virtually into its bi- 
sulfite treated version before performing a similarity 

15 search against the primer pairs, which is based on a 

method like the so called e-PCR (Schuler G.D. (1997) Se- 
quence Mapping by electronic PCR. Genome Research 7(5): 
541-550) . However, as the bisulfite conversion results in 
two no longer complementary strands this virtual hybridi- 

2 0 zation test needs to be done against both bisulfite con- 

verted strands. In addition in most cases the template 
DNA is contaminated with unconverted genomic DNA. To also 
exclude unwanted amplification on the unconverted DNA as 
template, the same hybridization test has to be performed 
25 a third time using the whole human genome sequence as a 
template. 

Therefore it is a preferred embodiment of this invention 
that the ability of said primer molecules to amplify an 
30 unwanted product is tested by means such as electronic 
PCR. 

In the last step of the method said amplified target nu- 
cleic acid gets detected by any means standard to one 

3 5 skilled in the art. 
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In a preferred embodiment of this method the set of 
primer molecules is comprised of at least two primer 
molecules but not more than 64 primer molecules, given 
the number is a multiple of 2; in other words, the set is 
5 comprised of 1-32 primer pairs. 

In another preferred embodiment of this method the set of 
primer molecules is comprised of between 2 and 32 primer 
molecules, given the number is a multiple of 2; in other 
10 words the set is comprised of 1-16 primer pairs. 

In a preferred embodiment of this method, said primer 
molecule comprises at least one nucleotide within the 
last three nucleotides from the 3' end of the molecule, 
15 wherein said nucleotide is complementary to a nucleotide 
of the target sequence that, as a result of the treatment 
performed in step 2) of the invention, changed its hy- 
bridization behavior. 

20 It is a preferred embodiment of this method, that said 

primer molecule comprises at least one nucleotide within 
the last three nucleotides from its 3' end that is com- 
plementary to a nucleotide of the target sequence that 
was converted by the treatment performed in step 2 of the 

25 method to another base exhibiting an alternative base 
pairing behavior. 

In an especially preferred embodiment said nucleotide is 
a cytosine prior to the treatment that converts unmethy- 

30 lated cytosines. In a preferred embodiment said treatment 
is bisulfite treatment. Said primer molecule comprises at 
least one nucleotide within the last three nucleotides 
from the 3' end of the molecule, wherein said nucleotide 
is complementary to a cytosine, that was converted by bi- 

3 5 sulfite treatment to another base exhibiting the base 
pairing behavior of thymine. 
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This is to exclude binding of said primer molecules to 
the remaining untreated or un-suf f iciently treated nu- 
cleic acids, which might still serve as template nucleic 
5 acid in the PCR. 

Furthermore it is a preferred embodiment of this inven- 
tion that said primer molecules do not form loops or 
hairpins on their own or with each other. 

10 

In another preferred embodiment of the method said primer 
molecules do not form dimers with each other. 

In the text the word 'hairpin' is taken to mean a secon- 
15 dary structure formed by a primer molecule when the 3 1 

terminal region of said nucleic acid hybridizes to the 5' 
terminal region of said nucleic acid forming a double 
stranded stem structure and wherein only the central re- 
gion of the primer is single stranded. 

20 

As described in the text the word 'loop' refers to a sec- 
ondary structure formed by a primer molecule when two or 
more nucleotides of said molecule hybridize thereby form- 
ing a secondary structure comprising a double stranded 
25 structure one or more base pairs in length and further 
comprising a single stranded region between said double 
stranded region. 

The binding of a primer molecules 3 ' end to any part of a 
3 0 second primer molecule in the set needs to be avoided. 
Otherwise the polymerase would extend the first primer 
using the second primer as template, which would lead to 
a new unwanted product, an extended primer, or rather a 
primer-hybrid, which would serve as the preferred tem- 
35 plate for the next round of the polymerase chain reaction 
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and thereby prevent a sufficient amplification of the 
wanted product. 

Therefore it is another preferred embodiment of this 
5 method that each of said primer molecules is character- 
ized in that the last at least 5 bases at the 3' end of 
said primer molecule are not complementary to the se- 
quence of any other primer molecule in the set. 

10 It is also preferred that said primer molecules do not 

bind to nucleic acids which prior to treatment of step 2 
contained a 5'-CG-3' site. This would lead to a binding 
of the primers to bisulfite treated nucleic acids, spe- 
cifically depending on their cytosines methylation 

15 status. A CG corresponding primer would bind to the 

treated methylated version only, whereas a primer corre- 
sponding to TG would bind to the treated unmethylated 
version of these nucleic acids only. It is therefore pre- 
ferred that said primer molecules do not contain nucleic 

2 0 acid sequences complementary or identical to nucleic acid 
sequences which prior to treatment of step 2 contained a 
5'-CG-3' site. 

In a preferred embodiment of this method said primer 

2 5 molecules are of a specified size range. 

It is especially preferred that these primers are com- 
prised of 16-50 nucleotides. 

3 0 In a preferred embodiment of this method said primer 

molecules do not comprise sequences that are complemen- 
tary to regions of the target nucleic acids that con- 
tained specified restriction enzyme recognition sites 
prior to the treatment that altered the unmethylated cy- 
3 5 tosines base pairing behavior. It is preferred that said 
primers are complementary to target sequences which prior 
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to the treatment performed in step 2 of the invention did 
not contain specified restriction enzyme recognition 
sites . 

5 By selecting for the right primer molecules also the am- 
plificates sequence is determined. That is why it has to 
be taken into account to only use those primer molecules 
that lead to amplification of nucleic acids containing a 
reasonable high number of CpG sites to be analyzed. Due 
10 to the treatment of step 2 of this invention these CpG 
sites, depending on the methylation status of the cyto- 
sine, are converted and will therefore either appear as 
CG dinucleotides or as TG dinucleotides in the amplifi- 
cate . 

15 

It is preferred that said primer molecules amplify re- 
gions of nucleic acids that prior to bisulfite treatment 
comprise of more than eight 5'-CG-3' sites also referred 
to as CG dinucleotides. 

20 

It is also preferred that said primer molecules amplify 
regions of nucleic acids that prior to bisulfite treat- 
ment comprise of more than six 5'-CG-3' sites also re- 
ferred to as CG dinucleotides. 

25 

It is also preferred that said primer molecules amplify 
regions of nucleic acids that prior to bisulfite treat- 
ment comprise of more than four 5'-CG-3' sites also re- 
ferred to as CG dinucleotides and finally it is espe- 
30 cially preferred that said primer molecules amplify re- 
gions of nucleic acids that prior to bisulfite treatment 
comprise of more than two 5'-CG-3' sites also referred to 
as CG dinucleotides. 

35 Said primer molecules lead to amplificates within a 
specified size range. 
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It is a preferred embodiment of this sequence that said 
primer molecules lead to amplificates which are comprised 
of at least 50 bp but not more than 2000 bp. 

5 

Especially preferred are primer molecules that lead to 
amplificates which are comprised of at least 80 bp but 
not more than 1000 bp. 

10 Furthermore a method is preferred wherein said primer 

molecules lead to amplificates of treated nucleic acids 
which prior to the treatment which altered the unmethy- 
lated cytosines base pairing behavior did not contain re- 
striction enzyme recognition sites. Said primer molecules 

15 lead to amplificates that are amplified regions of the 
treated nucleic acids which prior to the treatment per- 
formed in step 2) of the method did not contain specified 
restriction enzyme recognition sites. 

2 0 A further subject of this invention is a method on how to 
produce said primer molecules. The main step of producing 
a primer molecule is determining its sequence. In the 
following the phrase "primer design" will be used instead 
of primer production, whenever it is referred to the step 

25 of determining said specific primer sequences. Designing 
primer molecules is a process which as such is well known 
to scientists skilled in the art. The programs usually 
used for this purpose are such as PRIMER3 or OSP (Rozen S 
and Skaletsky H (2000) PRIMER3 on the WWW for general us- 

30 ers and for biologist programmers. Methods Mol Biol 132: 
365-386; Hillier L and Green P (1991) OSP: A computer 
program for choosing PCR and DNA sequencing primers. PCR 
Methods and Applications 1: 124-128). Other primer design 
systems (like described in EP-A 1136932) are often based 

35 on those commonly known programs. 
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An embodiment of this invention takes advantage of using 
a program like PRIMER3 first, to then add a number of 
steps that finally result in an advanced method of de- 
signing primers that are specifically useful for amplify- 
5- ing sequences of low complexity. 

In the first step of this method for designing specific 
primer molecules for nucleic acids of low complexity, 
primer pairs that amplify single products are selected by 
10 applying standard tools of primer design known in the 

art, like for example the program PRIMER3 (Rozen, S and 
Skaletsky, H (2000) Methods Mol Biol 132: 365-386). 

In the second step of the method said primer pairs are 
15 tested whether or not one of its primer molecules when 
hybridizing to any other primer molecule in the set ex- 
ceeds a specified threshold melting temperature TM. If 
this is the case the primer pair that comprises of said 
primer is excluded from the set of potentially combined 
2 0 pairs. 

In the third step of the method the number of previously 
selected primer pairs, is reduced to a smaller number by 
implementing as new criteria a measure for the primer se- 

2 5 quence's complexity. Primer pairs that consist of a 

primer molecule which does not meet said criteria are ex- 
cluded. 

The basic problem of finding a primer specific enough to 

3 0 give only one product on the little complex bisulfite 

DNA, is finally solved by testing each potential primer 
pair for hybridization across the whole bisulfite con- 
verted human genome. This requires translating the whole 
human genome sequence information virtually (as in "in 
35 silico") into its treated, for example bisulfite treated, 
version before performing a similarity search against the 
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primer pairs, which is based on a method like the so 
called e-PCR (Schuler G.D. (1997) Sequence Mapping by 
electronic PGR. Genome Research 7(5): 541-550). However, 
as the bisulfite conversion results in two different ver- 
5 sions of the double helix whose sense and anti-sense 
strands are no longer mutually complementary, this in 
silico amplification needs to be performed on both bisul- 
fite converted versions of the genome. In addition in 
most cases the template DNA is contaminated with uncon- 

10 verted genomic DNA. It cannot be excluded that single cy- 
tosines or longer runs of DNA remain unconverted or are 
only converted incompletely by the bisulfite treatment. 
To also exclude unwanted amplification of the unconverted 
DNA as template, the same hybridization test has to be 

15 performed a third time using the whole human genome se- 
quence as a template. 

As this is quite some effort and requires time (CPU time) 
this is the fourth and last step of this design method, 
20 that is absolved prior to the final testing in a "wet", 
lab based, experiment. 

In addition to improve the specificity of said primer 
molecules the stringency of the selection criteria is 

25 increased: Some standard primer design tools disqualify a 
primer if in the template sequence, a second identical 
sequence, besides the target sequence, can be found. That 
way mispriming at rather stringent hybridization 
conditions is avoided. This mispriming would not 

30 necessarily lead to an additional unwanted product, but 
would lead to the dilution of the primer molecules 
available for amplification. This selection has been 
performed in step one already (for example by PRIMER3) . 
However, due to the higher probability of a bisulfite 

35 primer molecule to mismatch with non-identical bisulfite 
treated DNA, there is still a chance for said primer 
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molecules to misprime even when up to 20% of the 
nucleotides of the primer sequence differ. Therefore it 
is claimed in this invention to only use primer molecules 
for which not even a weak sequence homology can be found. 
5 However, this would exclude primer molecules 

unnecessarily. Therefore they are only excluded if two 
primer molecules match to the template and amplify an 
unwanted product. This test is performed by means as, for 
example, the Electronic PCR. Electronic PCR (e-PCR) is an 
10 in silico virtual PCR carried out in order to asses the 
suitability of primers prior to in vitro PCR. 

In the fourth step of the method on how to design these 
primers it is therefore tested whether there are any 

15 regions of the template nucleic acid, said template being 
comprised of the sense and the anti-sense strand of the 
treated and the untreated nucleic acids, that are 
identical in sequence with the primer molecule to more 
than 80% and if those primer molecules are able to 

20 amplify an unwanted product. If this is the case, the 

primer pair comprising said primer molecule is excluded 
from the selection. 

The template nucleic acid is comprised of the treated 
25 template nucleic acid and the untreated template nucleic 
acid. The treated nucleic acid in itself is comprised of 
a two strands which after treatment are not complementary 
to each other anymore. This virtual testing for example 
can be performed as described by Gregory Schuler in his 
30 article (cited above) about sequence mapping by 

"Electronic PCR". The primer pairs remaining can be used 
to specifically amplify regions of nucleic acids of low 
complexity, which is the aim of this invention. Hence 
step 4 of the design method is the virtual testing of 
35 each possible primer pair combination, under pre- 

specified conditions at a stringency allowing for one or 
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more base pair mismatches, as to whether no unwanted 
nucleic acids are amplified. Said virtual testing is 
carried out upon both untreated and treated nucleic 
acids. The wording "possible combinations" refers to all 
5 combinations that are possible within a set of primer 
pairs to be used in one amplification reaction vessel. 

In a preferred embodiment an additional step is added 
following the virtual testing, which is testing in a lab 

10 based single PCR assay all those pairs that remained, 

whether the desired amplificate can be obtained or not. 
If that is the case, the chosen pairs can be used to 
specifically amplify those regions of nucleic acids of 
low complexity according to the method as described 

15 before. 

In a specially preferred embodiment the first step of the 
design method is characterized as selecting a pool of 
possible primer pairs per amplificate by means of a stan- 
dard PCR primer design program using said nucleic acids 
20 as template that have been masked for repeats and SNPs 
considering the following factors: 

length of amplificate, length of primer, melting tempera- 
ture of the primer molecule, dimer formation parameters, 
loop formation parameters, exclusion of unidentified or 
25 ambiguous nucleotides in the primer sequence, exclusion 
of restriction enzyme recognition sites. 

In a preferred embodiment of this invention this measure 
of complexity is a measure of linguistic complexity as 
30 defined by Bolshoy et al . (see above) . Those primer pairs 
are excluded from the previously selected ones, which 
comprise of one primer that doesn't reach a set level of 
this linguistic complexity. 
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In another preferred embodiment of this invention this 
measure of complexity is a measure of Shannon entropy (as 
described before) . 

5 In an especially preferred embodiment of this design 

method, prior to performing step d) the additional step 
of excluding primer pairs from the remaining primer pairs 
which consist of a primer molecule that comprises of at 
least one CpG site, is carried out. 

10 

In an especially preferred embodiment of this method ac- 
cording to the design of said primers, prior to perform- 
ing step d) the additional step of excluding primer pairs 
from the remaining pairs when one of its primer molecules 
15 does not contain at least one nucleotide within the last 
three nucleotides from the 3' end of the molecule wherein 
said nucleotide is complementary to a nucleotide of the 
target sequence that was converted to a different nucleo- 
tide by bisulfite treatment, is carried out. 

20 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step, of excluding primer 
pairs from the remaining primer pairs which amplify a nu- 
25 cleic acid that did not prior to treatment with bisulfite 
contain a minimum of two CpG sites, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
30 forming step d) the additional step of excluding primer 
pairs from the remaining primer pairs when one of its 
primer molecules contains more than 5 bases at its 3' end 
that are complementary to any other primer molecules se- 
quence in the set, is carried out. 



35 
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In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 
5 primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for a number of 
mismatching nucleotides of 20% of the number of nucleo- 
10 tides of the primer molecule, is carried out. 

In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 

15 remaining primer pairs those pairs, which comprise of one 
primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for a number of 

2 0 nucleotides creating one gap, when aligning the primer 
molecule sequence with the template sequence, of up to 
20% of the number of nucleotides of the primer molecule, 
is carried out. 

2 5 In an especially preferred embodiment of this method ac- 

cording to the production of said primers, prior to per- 
forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 
primer molecule that in combination with another primer 

3 0 molecule in the set amplifies an unwanted product, when 

virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for four or less 
mismatching base pairs, is carried out. 

35 In an especially preferred embodiment of this method ac- 
cording to the production of said primers, prior to per- 
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forming step d) the additional step of excluding from the 
remaining primer pairs those pairs, which comprise of one 
primer molecule that in combination with another primer 
molecule in the set amplifies an unwanted product, when 
5 virtually testing according to step 3 c) of the amplifi- 
cation method under conditions allowing for two or less 
mismatching base pairs, is carried out. 

The following example is intended to illustrate the in- 
10 vention : 

Example 

Here we present experimental data that shows that multi- 
15 plex PCRs designed with a tool according to this inven- 
tion are more successful compared to multiplex PCRs not 
designed in this manner. 

It is the aim of the experiment to amplify 40 different 
20 nucleic acids. The genomic regions of interest are given 
in the sequence protocol (SEQ ID 41-80) . These genomic 
sequences were translated into their bisulfite converted 
versions and served as templates for amplification of 
specific regions with the primer sequences described as 
25 follows. 

Primer molecule pairs used for single PCRs were origi- 
nally designed with the use of the standard primer design 
program PRIMER3 (as mentioned in the description) . The 
3 0 criteria used in that step will not be discussed in de- 
tail. This selection however provides several possible 
primer pairs per amplificate. Following the present in- 
vention these primer pairs were selected further, accord- 
ing to the following criteria: 

35 
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• The restriction enzyme recognition site to be ex- 
cluded from the genomic nucleic acid (which subse- 
quent to bisulfite conversion becomes the template 
for the PCR amplification step) is : GTTTAAAC . 

5 

• The minimum length of the primer molecule is 18 nu- 
cleotides. The maximum length is 27 nucleotides. 
Ideally the primer consists of 22 nucleotides. 

10 • The minimum required measure of linguistic complex- 

ity is 0.2. 

• The minimum melting temperature of a primer molecule 
is 54°C and the maximum melting temperature is 57 °C. 

15 The ideal melting temperature however is 55°C. 

• The minimum length of an amplificate is 100 bp and 
the maximum length is 500 bp. 

2 0 • The minimum number of CpG sites, that were present 

in the region of the nucleic acid, prior to bisul- 
fite treatment, that was amplified is 4. 

• The number of mismatch bases allowed for when virtu- 
2 5 ally testing the primer pairs according to the in- 
vention for amplification of an unwanted product 
with the help of e-PCR (Electronic PCR) is 2. 

The use of this invention, that is the use of either the 
30 design method, being the subject of the invention, and/ or 
performing the steps of said method as described above 
(assuming a set size of 1) leads to the selection of the 
following 40 optimized primer molecule pairs: 



35 
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number starting position of 
indicating primer in the bisul- 





amplificate 


SEQ 


primer 


fite converted se- 


primer sequence 


identifier 


ID 


direction 


quence of the ROI 


AATCCTCCAAATTCTAAAAACA 


2025 


81 


0 


1816 


AGGAAAGGGAGTGAGAAAAT 


2025 


82 


1 


2138 


GGATAGGAGTTGGGATTAAGAT 


2044 


83 


0 


2070 


AAATCI 1 1 1 1 C AAC ACCAAAAT 


2044 


84 


1 


2483 


AACCCTTTCTTCAAATTACAAA 


2045 


85 


0 


1340 


TGATTGGG 1 1 1 1 AGGGAAATA 


2045 


86 


1 


1687 


TTGAAAATAAGAAAGGTTGAGG 


2106 


87 


0 


1481 


CTTCTACCCCAAATCCCTA 


2106 


88 


1 


1764 


TGTTTGGGATTGGGTAGG 


2166 


89 


0 


2226 


CATAACCTTTACCTATCTCCTCA 


2166 


90 


1 


2437 


TTTTAGATTGAGG 1 1 1 IAGGGT 


2188 


91 


0 


101 


ATCCATTCTACCTCCI 1 1 1 ICT 


2188 


92 


1 


598 


GGAGGGGAGAGGGTTATG 


2191 


93 


0 


133 


TACTATACACACCCCAAAACAA 


2191 


94 


1 


506 


TTTTGGGAATGGGTTGTAT 


2194 


95 


0 


1628 


CTACCCTTAACCTCCATCCTA 


2194 


96 


1 


1996 


TTGTTGGGAGI 1 1 1 IAAGTTTT 


2212 


97 


0 


1711 


CAAATTCTCCTTCCAAATAAAT 


2212 


98 


1 


2063 


GTAATTTGAAGAAAGTTGAGGG 


2267 


99 


0 


1709 


CCAACAACTAAACAAAACCTCT 


2267 


100 


1 


2004 


GGAGTTGTATTGTTGGGAGA 


2317 


101 


0 


1110 


TAAAACCCCAATTTTCACTAA 


2317 


102 


1 


1388 


TTTGTATTAGGTTGGAAGTGGT 


2383 


103 


0 


1 


CCCAAATAAATCAACAACAACA 


2383 


104 


1 


285 


GAI 1 1 1 1 GGAGAGGAAGTTAAG 


2387 


105 


0 


789 


AAAACTAAAAACCAAACCCATA 


2387 


106 


1 


1169 


TGGGGTTAGTTTAGGATAGG 


2391 


107 


0 


1353 


CTTAAAAACACTAAAACTTCTCAAA 


2391 


108 


1 


1750 


1 1 1 1 IGTATTGGGGTAGGTTT 


2395 


109 


0 


547 


CCCAACTATCTCTCTCCTCTATAA 


2395 


110 


1 


1094 



1386-WO 

44 



number starting position of 
indicating primer in the bisul- 





amplificate 


SEQ 


primer 


fite converted se- 


primer sequence 


identifier 


ID 


direction 


quence of the ROI 


ATTAGAAGTGAAAGTAATGGAATT1 


r 2401 


111 


0 


381 


TCAATTTCCAAAAACCAAC 


2401 


112 


1 


795 


GGGATGGGTTATTAGTTGTAAA 


2453 


113 


0 


1867 


CCTTCACACAAAACTACAAAAA 


2453 


114 


1 


2139 


TAATTGAAGGGGTTAATAGTGG 


2484 


115 


0 


1861 


AAAACCAAAACCAAAACTAAAA 


2484 


116 


1 


2252 


AGTGGATTTGGAGTTTAGATGT 


2512 


117 


0 


1016 


AACAAAATAAAAACTTCTCCCA 


2512 


118 


1 


1446 


TAGGGGAAAAGTTAGAGTTGAG 


2741 


119 


0 


1413 


CCCATTAACCCACAAAAA 


2741 


120 


1 


1888 


ATTTTAGTTTGTGAAATGGGAT 


2745 


121 


0 


1685 


TCTTAACCAATAACCCCTCAC 


2745 


122 


1 


2097 


GTGGGTTTTGGGTAGTTATAGA 


2746 


123 


0 


1679 


TAACCTCCTCTCCTTACCAA 


2746 


124 


1 


2163 


TAGGATGGGGAGAGTAATGTTT 


2747 


125 


0 


972 


ACAACTTATCCAACTTCCATTC 


2747 


126 


1 


1448 


TCCCACAAAAACTAAACAATTA 


2749 


127 


0 


1370 


AGGTTTTAGATGAAGGGGTTT 


2749 


128 


1 


1789 


TTTGGAGGGTTTAGTAGAAGTTA 


2751 


129 


0 


88 


CCCAATAATCACAAAATAAACA 


2751 


130 


1 


567 


ATACAACCTCAAATCCTATCCA 


2752 


131 


0 


228 


AGGGAGAAGGAAGTTATTTGTT 


2752 


132 


1 


712 


GGAAGATGAGGAAGTTGATTAG 


2755 


133 


0 


1000 


CCTACAACCCTATCCTCTAAAA 


2755 


134 


1 


1371 


TTAGTAGGGGTGTGAGTGTTTT 


2831 


135 


0 


1313 


CAAACAAAACTTCTATCTCAACC 


2831 


136 


1 


1499 


TTATAGGGTTGAGTTTGGGAT 


2850 


137 


0 


2100 


TAAACAAACAACAAATCTTCCA 


2850 


138 


1 


2400 


TGAAAATGAAGGTATGGAGTTT 


2852 


139 


0 


1262 


TTAAAACCATATAATCCCTCCA 


2852 


140 


1 


1583 
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number starting position of 
indicating primer in the bisul- 





amplificate 


SEQ 


primer 


fite converted se- 


primer sequence 


identifier 


ID 


direction 


quence of the ROI 


TATGTTTGG 1 1 1 1 G 1 1 1 1 GAGA 


2859 


141 


0 


1093 


AACCCCATCACI 1 1 IATTTCTT 


2859 


142 


1 


1491 


GGGTGTAGAAGTGTTTAGGTTT 


2861 


143 


0 


2385 


TTTCTCCCCTTACAACAATAAC 


2861 


144 


1 


2732 


TCCCCTTCCAACTATATCTCTC 


2864 


145 


0 


884 


TGAGAGTG 1 1 1 IAGGGAAGTTT 


2864 


146 


1 


1175 


AAAACCAAAACATAAACCAAAA 


2867 


147 


0 


1312 


GATTAGGAGGGTTTGTTGAGAT 


2867 


148 


1 


1701 


AATGGTTGATGA 1 1 1 IGGTTT 


2961 


149 


0 


2039 


ACTCTCTTCCCTATACCCCTAA 


2961 


150 


1 


2311 


AGTTAGAAGAGGAGTTAGGATGG 


3511 


151 


0 


1340 


TAATTTTCCAATACCCAI I 1 IC 


3511 


152 


1 


1711 


TGTTAGTAGAGTTTTAGGGAGGTT 


3532 


153 


0 


1135 


ACACTACCTATCCTTACCCCAC 


3532 


154 


1 


1592 


1 1 1 1 IGI 1 1 1 IATGGGGTGTAT 


3534 


155 


0 


1909 


TTAAATATCCCTTCCTTAACCA 


3534 


156 


1 


2385 


TGGGTAGTA 1 1 1 1 IGTTGGTTT 


3538 


157 


0 


956 


CCTAAAAACTCTCTCATCCTCA 


3538 


158 


1 


1414 


AGTGGTTTAGGAGTATTTGGTTA 


3540 


159 


0 


659 


AACTCCCTCCATCTACAATATC 


3540 


160 


1 


1064 



These primer pairs lead to the amplification of specific 
regions (amplif icates Seq IDs 1- 40) of the bisulfite 
5 converted sequences of the genomic ROIs (Seq IDs 41- 80) 
of interest. The ROIs can be identified by the four digit 
number that specifies the ROI and the corresponding am- 
plificate - as indicated in the following table. 

10 TABLE 2: 



SEQ ID 


Class 


Identifier 


Kind of DNA 




SEQ ID 


Class 


Identifier 


Kind of DNA 
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SEQ ID 


Class 


Identifier 


Kind of DNA 




SEQID 


Class 


Identifier 


Kind of DNA 


1 


amplificate 


2025 


bisulfite se- 
quence 




41 


ROI 


2025 


genomic se- 
quence 


2 


amplificate 


2044 


bisulfite se- 
quence 




42 


ROI 


2044 


genomic se- 
quence 


3 


amplificate 


2045 


bisulfite se- 
quence 




43 


ROI 


2045 


genomic se- 
quence 


4 


amplificate 


2106 


bisulfite se- 
quence 




44 


ROI 


2106 


genomic se- 
quence 


5 


amplificate 


2166 


bisulfite se- 
quence 




45 


ROI 


2166 


genomic se- 
quence 


6 


amplificate 


2188 


bisulfite se- 
quence 




46 


ROI 


2188 


genomic se- 
quence 


7 


amplificate 


2191 


bisulfite se- 
quence 




47 


ROI 


2191 


genomic se- 
quence 


8 


amplificate 


2194 


bisulfite se- 
quence 




48 


ROI 


2194 


genomic se- 
quence 


9 


amplificate 


2212 


bisulfite se- 
quence 




49 


ROI 


2212 


genomic se- 
quence 


10 


amplificate 


2267 


bisulfite se- 
quence 




50 


ROI 


2267 


genomic se- 
quence 


11 


amplificate 


2317 


bisulfite se- 
quence 




51 


ROI 


2317 


genomic se- 
quence 


12 


amplificate 


2383 


bisulfite se- 
quence 




52 


ROI 


2383 


genomic se- 
quence 


13 


amplificate 


2387 


bisulfite se- 
quence 




53 


ROI 


2387 


genomic se- 
quence 


14 


amplificate 


2391 


bisulfite se- 
quence 




54 


ROI 


2391 


genomic se- 
quence 


15 


amplificate 


2395 


bisulfite se- 
quence 




55 


ROI 


2395 


genomic se- 
quence 


16 


amplificate 


2401 


bisulfite se- 
quence 




56 


ROI 


2401 


genomic se- 
quence 


17 


amplificate 


2453 


bisulfite se- 




57 


ROI 


2453 


qenomic se- 
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SEQ ID 


Class 


Identifier 


Kind of DNA 




SEQ ID 


Class 


Identifier 


Kind of DNA 








quence 










quence 
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SEQ ID 


Class 


Identifier 


Kind of DNA 




SEQ ID 


Class 


identifier 


Kind of DNA 


18 


amplificate 


2484 


bisulfite se- 
quence 




58 


ROI 


2484 


genomic se- 
quence 


19 


amplificate 


2512 


bisulfite se- 
quence 




59 


ROI 


2512 


genomic se- 
quence 


20 


amplificate 


2741 


bisulfite se- 
quence 




60 


ROI 


2741 


genomic se- 
quence 


21 


amplificate 


2745 


bisulfite se- 
quence 




61 


ROI 


2745 


genomic se- 
quence 


22 


amplificate 


2746 


bisulfite se- 
quence 




62 


ROI 


2746 


genomic se- 
quence 


23 


amplificate 


2747 


bisulfite se- 
quence 




63 


ROI 


2747 


genomic se- 
quence 


24 


amplificate 


2749 


bisulfite se- 
quence 




64 


ROI 


2749 


genomic se- 
quence 


25 


amplificate 


2751 


bisulfite se- 
quence 




65 


ROI 


2751 


genomic se- 
quence 


26 


amplificate 


2752 


bisulfite se- 
quence 




66 


ROI 


2752 


genomic se- 
quence 


27 


amplificate 


2755 


bisulfite se- 
quence 




67 


ROI 


2755 


genomic se- 
quence 


28 


amplificate 


2831 


bisulfite se- 
quence 




68 


ROI 


2831 


genomic se- 
quence 


29 


amplificate 


2850 


bisulfite se- 
quence 




69 


ROI 


2850 


genomic se- 
quence 


30 


amplificate 


2852 


bisulfite se- 
quence 




70 


ROI 


2852 


genomic se- 
quence 


ol 


amplificate 


2859 


bisulfite se- 
quence 




71 


ROI 


2859 


genomic se- 
quence 


32 


amplificate 


2861 


bisulfite se- 
quence 




72 


ROI 


2861 


genomic se- 
quence 


33 


amplificate 


2864 


bisulfite se- 
quence 




73 


ROI 


2864 


genomic se- 
quence 
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SEQ ID 


Class 


Identifier 


Kind of DNA 




SEQ ID 


Class 


Identifier 


Kind of DNA 


34 


amplificate 


2867 


bisulfite se- 
quence 




74 


ROI 


2867 


genomic se- 
[ quence 


35 


amplificate 


2961 


bisulfite se- 
quence 




75 


ROI 


2961 


genomic se- 
quence 


36 


amplificate 


3511 


bisulfite se- 
quence 




76 


ROI 


3511 


genomic se- 
quence 


37 


amplificate 


3532 


bisulfite se- 
quence 




77 


ROI 


3532 


genomic se- 
quence 


oo 
38 


amplificate 


3534 


bisulfite se- 
quence 




78 


ROI 


3534 


genomic se- 
quence 


39 


amplificate 


3538 


bisulfite se- 
quence 




79 


ROI 


3538 


genomic se- 
quence 


40 


amplificate 


3540 


bisulfite se- 
quence 




80 


ROI 


3540 


genomic se- 
quence 



The second task in this example is to select from these 
5 40 primer pairs those pairs which can be combined in five 
multiplex PCRs to amplify eight targets simultaneously. 

The following steps, as disclosed in the invention, are 
performed for selection of those subsets: 

10 

• The melting temperature of any combination of two of 
those primer molecules hybridizing to each other 
taking part in one multiplex experiment must be be- 
low 20°C. 

15 

• The last seven nucleotides from the 3 ' end of every 
primer molecule in a subset is used to check if 
those are complementary and/or binding to any other 
primer molecules' sequence used in the set. 



20 
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• The number of mismatch bases allowed for when virtu- 
ally testing the primer pairs for amplification of 
an unwanted product is 2. For this step every possi- 
ble combination of 16 primer molecules in one subset 
5 is checked for its ability to amplify an unwanted 

product. This is done by means of e-PCR (electronic 
PCR) . 



10 Having performed all these steps results in the selection 
of three different optimized sets of primer molecule 
pairs that can be used in multiplex PCRs . These sets are 
in the following described as a set of numbers . Each num- 
ber refers to a specific amplificate and therefore also 

15 to a single primer pair (out of the list given above) 

which proved to be able to specifically amplify said nu- 
cleic acid in a single PCR experiment. 
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TABLE 3: 



optimized set 1 


8plexl 


2194 


2191 


2391 


2025 


2961 


3540 


2861 


2188 


8plex2 


2484 


2106 


2401 


2850 


3532 


2044 


2512 


2852 


8plex3 


2453 


2741 


2867 


2755 


2267 


2387 


2864 


2317 


8plex4 


2859 


2383 


2752 


2747 


2751 


3511 


2212 


2746 


8plex5 


3534 


2395 


2745 


3538 


2749 


2166 


2831 


2045 


optimized set 2 


8plexl 


2166 


2212 


3511 


2383 


2745 


2859 


3534 


2861 


8plex2 


2749 


2191 


2751 


2395 


2961 


2512 


2831 


3538 


8plex3 


2850 


2025 


2188 


2317 


2391 


2852 


3540 


2194 


8plex4 


2106 


2387 


2867 


2864 


2401 


2747 


2746 


2453 


8plex5 


2044 


2484 


2267 


2755 


2752 


2741 


2045 


3532 


optimized set 3 


8plexl 


2194 


2391 


2191 


2749 


2745 


3538 


2861 


2961 


8plex2 


2166 


2188 


2859 


2212 


2864 


2746 


2383 


2752 


8plex3 


2484 


2401 


2850 


2852 


2512 


2755 


2106 


2044 


8plex4 


2867 


2453 


3532 


2025 


2741 


2267 


2317 


2387 


8plex5 


3511 


3534 


2751 


2747 


2395 


3540 


2831 


2045 



Without the use of said invention, the selection would 
have been performed randomly and tested for successful 
5 application later. Three randomly chosen subsets are 
shown here. 
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TABLE 4: 



random set 1 


8plexl 


2191 


2194 


2267 


2741 


3534 


3511 


2749 


2747 


8plex2 


2391 


2484 


2867 


2852 


2453 


2512 


2025 


3538 


8plex3 


2746 


2212 


2755 


2045 


2044 


2188 


2961 


2864 


8plex4 


2831 


2383 


3540 


2859 


2861 


2395 


2401 


2317 


8plex5 


2106 


2751 


2387 


2745 


2752 


3532 


2850 


2166 


random set 2 


8plexl 


2045 


2106 


2212 


2745 


2044 


2749 


2752 


2391 


8plex2 


2025 


2831 


2401 


3540 


2395 


2484 


2453 


2961 


8plex3 


2194 


2859 


2746 


2512 


2267 


2864 


2861 


2751 


8plex4 


2383 


2166 


2747 


2387 


3532 


2741 


2867 


2852 


8plex5 


3534 


2755 


2850 


2317 


2191 


3538 


3511 


2188 


random set 3 


8plexl 


2484 


2850 


2741 


2747 


2755 


2745 


2025 


2746 


8plex2 


2383 


3534 


2861 


2751 


2749 


2391 


2188 


2191 


8plex3 


2194 


3538 


2512 


2961 


2864 


2867 


2831 


3532 


8plex4 


3511 


2045 


2387 


2212 


2166 


2267 


3540 


2401 


8plex5 


2395 


2317 


2859 


2453 


2852 


2106 


2752 


2044 



The sequences of all of those amplif icates and the ac- 
5 cording primers are given in the sequence protocol (prim- 
ers SEQ IDs 81-160; amplificates SEQ IDs 1-40). SEQ IDs 
refer to the internal numbers used in these tables as is 
shown in TABLES 1 and 2 . 

10 To show if the use of the design method described herein 
was superior to the common method of selecting primers 
for simultaneous amplification randomly said multiplex 
PCRs were performed. This example hereby demonstrates the 
advantage of the method which is subject of the inven- 

15 tion: 
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A total of 40 amplif icates (with lengths ranging from 187 
- 499 bp) were partitioned into five 8-plex PCRs using 
either of two strategies. 

5 First: the grouping was based on the invention using said 
"optimised sets" ("designed group"). 

Second: the grouping was done without using the selection 
criteria established by this invention using the * random 
10 sets" ("control group"). 

Whether such grouping can improve the success rate of 
mPCRs was subsequently tested experimentally by comparing 
the number of true and false positives and false nega- 
15 tives for each of the two classes. 

Each of the five mPCRs (multiplex PCRs) contained 8 
primer pairs specific for 8 amplificates with one primer 
of each pair being labeled with a Cy-5 fluorescent tag. 

2 0 Only fragments that performed successfully in sPCR 

(singleplex PCR) using bisulf ite-modif ied human DNA from 
whole blood were included in this study. Isomolar primer 
concentrations were used in a 2 0ul PCR reaction volume 
and cycling was done for 42 cycles using a 96-well micro- 
25 titer plate thermocycler . 

Group assignments for the "optimized" and "random" groups 
were done in triplicate and all mPCRs were run at the 
same time such as to minimize experimental variation in 

3 0 PCR performance. 

A mixture of the amplificates that were expected to be 
generated in a specific mPCR reaction but were generated 
in eight corresponding sPCR reactions was called sPCR- 
35 pool. Electrophoresis of sPCR-pool amplificates and mPCR 
amplificates was done simultaneously using the ALFexpress 
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system (Amersham Pharmacia) . In order to obtain the best 
comparability for mPCRs with their respective sPCR stan- 
dard, these products were electrophoresed next to each 
other on the gels. 

5 

Figures 1 and 2 show examples of these results as elec- 
tropherograms, given as ALFexpress output files. 

Success or failure scoring for each mPCR was based on as- 
10 sessing the number of generated or absent fragments com- 
pared to their respective pool of sPCR fragments. Only 
fragments with peak areas equal or larger than 8% of the 
largest peak within one electropherogram were included 
into the analysis. 

15 

Figure 1 illustrates a result of an 8-plex PCR based on a 
primer combination from the "optimized set". The top 
graph in the figure shows peaks of size standards only. 
The second graph in the figure shows the electrophoresed 

2 0 mixture of the products from 8 singleplex PCRs. The third 
graph shows the products resulting from a multiplex PCR 
employing one of the optimized sets of primer combina- 
tions. By comparing these graphs it becomes visible that, 
in this specific example, there is only one false nega- 

25 tive (FN) and three false positives (FP) , whereas there 
are eight true positives (TP) . 

Figure 2, however, illustrates a result of an 8-plex PCR 
based on a primer combination from the control set". The 

30 top graph in the figure shows peaks of size standards 

only. The second graph in the figure shows the electro- 
phoresed mixture of the products from 8 singleplex PCRs. 
The third graph shows the products resulting from a mul- 
tiplex PCR employing one of the randomly chosen sets, as 

35 is the state of the art. This graph clearly shows that, 
there are eight false negative and six false positive 
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peaks, whereas there is only one true positive. Hence, 
for this specific example we have demonstrated the supe- 
riority of the design method. 

5 A more comprehensive view on the results is given in Fig- 
ures 3 and 4. 

By applying the Wilcoxon rank sum test for the determina- 
tion of false positives or false negatives as follows, it 
10 becomes evident that the optimized set resulted in a more 
reliable amplification experiment: 

data: False negatives (FN) 

p-value = 0.02602 rejection of null hypothesis 
15 null hypothesis (HO) : true if median of designed set 

equal or greater than of control set alternative hypothe- 
sis (HI) : true if median of designed set less than of 
control set 

20 data: False positives (FP) 

p-value = 0.06711 rejection of null hypothesis 

null hypothesis (HO) : true if median of designed set 

equal or less than of control set 

alternative hypothesis (HI) : true if median of designed 
25 set greater than of control set 

data: True positives (TP) 

p-value = 0.02146 rejection null hypothesis 
null hypothesis (HO) : true if median of designed set 
30 equal or less than of control set 

alternative hypothesis (HI) : true if median of designed 
set greater than of control set 

Figure 3 illustrates a summary of several such compari- 
35 sons (as described in detail above) . Six diagrams are 
shown, that illustrate the numbers of false positives 
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(FP) , false negatives (FN) and true positives (TP) for a 
number of 18 experiments. In the top row of figure 3 the 
results for experiments that employed the design method 
are shown whereas in the lower row results from experi- 
5 ments are shown, that did use the conventional method of 
random selection. 

At the x-axis the occurrence of an event (like a false 
positive) per 8plex is given whereas the values of the y- 
10 axis indicate the frequency of an event like this occur- 
ring within the number of experiments performed. - 

For example, in the diagram title FN, a y-value of 0 in- 
dicates that the event did not occur in a s ingle experi- 

15 ment, a y-value of four indicates that the according num- 
ber of occurrences given as the x-value was found in four 
experiments (out of the 18 experiments considered for 
these analyses) . The x-value indicates what kind of oc- 
currence is counted; a x-value of three in this diagram 

2 0 indicates the occurrence of three false negatives. A data 
point with an x-value of 0 and an y-value of 9 means, 
that in the set of mPCR results considered, nine experi- 
ments showed 0 false negatives. 

2 5 Figure 4 gives all of the data from the 18 multiplex PCR 
experiments of this example in one table. The letter A, 
heading the four columns presented on the left side, is 
indicating the results from multiplex PCRs of the de- 
signed group using the five optimized sets of primer 

30 pairs that have been designed and selected according to 
the invention. The letter C is indicating the results 
from multiplex PCRs of the control group using the five 
randomized sets of primer pairs. 



The first column lists the identifying numbers of the ex- 
periments, the second column gives the numbers of true 
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positives (TP) within this experiment, the third column 
gives the numbers of false positives (FP) and the last 
column gives the numbers of false negatives (FN) . 

5 The average false negative rate (0 FN) of the optimized 
group is significantly lower than in the control group. 
Complementary the average true positive rate (0 TP) is 
significantly higher. The average false positive rates (0 
FP) of the two sets do not differ from each other sig- 
10 nificantly. 

This is due to the high deviation of false positives ob- 
served between individual ALFexpress analysis runs. Those 
36 sets of amplificates have been analyzed on two sepa- 

15 rate gel runs These runs were not designed to simply du- 
plicate the results, but could be used to analyze whether 
the average TP, FP and FN rates are similar, independent 
of the run, and the sets chosen. Only three of those sets 
have been duplicated, as indicated by the letters a and b 

20 for sets 11, 21 and 23. It turned out that the rate of 

true positives as well as the rate of false negatives av- 
eraged over 18 sets per run were highly reproducible, 
6.83 versus 7.33 and 1,44 versus 1.39 respectively. How- 
ever, the rate of false positives was determined as 4.11 

25 in the first run and 7.61 in the second run. 

Taken together, it could be concluded that the overall 
success rate of amplifying 40 fragments within 5 groups 
of 8plex PCRs was significantly increased when the primer 

3 0 grouping was based on the method being subject of this 
invention compared to an arbitrary primer grouping. The 
improved success rate of only 11% failures versus 24% in 
the random control group clearly becomes relevant when 
much larger numbers of mPCRs have to be established as is 

3 5 the case in a high throughput laboratory. 
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10/5230 

OT12fieG'dPCT/PT0.02 FEB 



Sequence listing 

<110> Epigenomics AG 

<120> Method for amplification of nucleic acids of low complexity 

<160> 160 

<210> 1 
<211> 322 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 2025 
<400> 1 



aatcctccaa 


attctaaaaa 


cataaaaata 


acgcaaccca 


aaaacaaaaa 


acccctccgc 


60 


ccattaatta 


ctatacacta 


acgaaacttt 


cccgacccac 


aacgacgaaa 


ataaaaacaa 


120 


tcgctaacgc 


taaaaaacat 


caaaaacact 


acccaaccca 


aatatcgccg 


ccgcttccac 


180 


aaaactctac 


taaacgccgc 


cgccgccgct 


accaccgcct 


ctaatccaaa 


ccacctcccg 


240 


ccaaataaac 


cccgaaatcc 


taactcaaat 


atatatctct 


ccctccctct 


ccctccattc 


300 


gtcattttct 


cactcccttt 


cc 








322 



<210> 2 
<211> 413 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 2044 
<400> 2 



ggataggagt tgggattaag attttcggtt agtttcgtat tttttcgtat tttttagtat 



60 
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cgtttcgtat ttttcgtatt ttttttcggg 
taacgtcgaa tttagtcgcg tagcgttgta 
gttttttaag gtaattacgt tttttttgtt 
atagaaaaaa attcgcgagt ttagaaaaaa 
5 taaagagttg taaggcgaag ttaagaaaat 

ataagggttt ttggggagag gtatatagag 

<210> 3 

<211> 347 

10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 2045 

15 

<400> 3 

aaccctttct tcaaattaca aaccttctta 
acaaaaaaac ccaatctaat aaaatacgct 
2 0 tatttcgtaa taaacgtaaa actaatcctc 

ttattatcta tctatcccac caaaatacaa 
tattttattc tatataattt tcccatacca 
aatacacatc tcgaaacgaa aaaaccgtat 

25 <210> 4 

<211> 283 

<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> 2106 

<400> 4 



59 



ttattacgtt 


ttttatgtga 


ttcgtttggg 


120 


gtgaattttt 


tttttaaatt 


gtaataagtc 


180 


ttttttttaa 


aaaataaaaa 


taaaaaattt 


240 


gaagtaattg 


gtagaaggtt 


ttaattaagg 


300 


gtaggtattt 


aaaaaatgta 


ggtaattttt 


360 


ggattttggt 


gttgaaaaag 


att 


413 



ccttcaaacc 


tcgactccaa 


caccaatccg 


60 


cccttcctac 


cattctctat 


tccattaacc 


120 


caaaattacc 


ttattaatta 


acttacatat 


180 


atttccgaaa 


aacaaaaatt 


taaaaaaatc 


240 


aacaccgtac 


ccgacacaaa 


ctaaaatccc 


300 


ttccctaaaa 


cccaatc 




347 



35 



ttgaaaataa gaaaggttga ggtagagagg ataatatagt 
tttgttaatt ttttttaatt tttagttata aattcgagat 



tttagtttat tttttagtat 
ataacgtttt ttttttaaag 



60 
120 
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aggtcgcgtt ttttttgtgg tggtttttag ggattcgttt tagttttttt ttcgttttta 180 
gttttatata ttgggattat taggtattta agattttatt ttttaggtgg tatttttagc 240 
gtaggttgtt atttagtttt tttttaggga tttggggtag aag 283 



5 <210> 5 

<211> 211 
<212> DNA 

<213> Artificial Sequence 



10 <220> 

<223> 2166 



<400> 5 



15 tgtttgggat tgggtagggt tatcggggtt gggggggcgg ggtttgtggg taaggcgggc 60 

ggaggcgtgg atttttcgtt cgatgatagg gttggaggag gaaggggcgg gttgaagaag 120 

gggaaggtgg gaagagttta gtcggggtta taaattgggt gaagcgttga ggttttagta 180 

ttttcgtttg aggagatagg taaaggttat g 211 



20 <210> 6 

<211> 497 

<212> DNA 

<213> Artificial Sequence 



25 <220> 

<223> 2188 



<400> 6 



ttttagattg aggttttagg gttaaaggat 


tatttttttt 


tttagcgttg 


gttcgggaaa 


60 


ggtaagtttc 


gggcgggagc 


gtacgtcgcg 


ttttcgaagt 


ttggtttttt 


cgttacgttt 


120 


attttttgtt 


tttatttcgc 


gtttttttag 


gttttttttc 


ggtgaatcgg 


atgttttgtt 


180 


agttttttat 


tttgcgtttt 


cggtcgcggt 


tcgggttttt 


cgtaaagtcg 


ttgttatttc 


240 


ggagggttta 


gttagcgggt 


tttcggaggt 


tggtcgggta 


ggcgtggtgc 


gcggtaggag 


300 


ttgggcgcgt 


acggttatcg 


cgcgtggagg 


agatattgtt 


ttgtcgcgat 


gggggttcgg 


360 


ggcgtttttt 


tacgtcgtag 


gtaagcgggg 


cggcggttgc 


ggtatttgtt 


tatcgggagt 


420 
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tttttttttt tttttttgtt gttgttgttt tgtatttagt tcgggggagg atagaagaaa 480 
aaggaggtag aatggat 497 

<210> 7 
<211> 373 
<212> DNA 

<213> Artificial Sequence 



<220> 
10 <223> 2191 

<400> 7 



15 



ggaggggaga 


gggttatgcg 


attttatttt 


tggttagggt 


cggggaggtt 


tttgtttttc 


60 


gggagttttg 


ttcgggtttt 


ttggtcgtag ggttgttggg 


ttttaggtag gaacgagagg 


120 


gtgaggttta 


tatgtggttc 


ggcggtttag 


ggcggtttgt 


agcgttttta 


ttgtttcggt 


180 


tgttaggggt 


tgcggcgacg 


cggttagtta 


gtagcgagtt 


taggtcgcgt 


agattttatt 


240 


gatgagtttt 


gatttttagt 


atttttttta 


agttaagaag 


agtttagcgt 


atttttcggt 


300 


tgttttattt 


tagttttttt 


gttttagttt 


tttagtttta 


ttttttttcg 


ttttgttttg 


360 


gggtgtgtat 


agt 










373 



20 



<210> 8 
<211> 368 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> 2194 
30 <400> 8 

ttttgggaat gggttgtatc gagaggttcg attagtttta gggttttagt gagggggtag 60 
tggaatttag cgagggattg agagttttat agtatgtacg agtttgatgt tagagaaaaa 120 
gtcgggagat aaaggagtcg cgtgttatta aattgtcgtc gtagtcgtag ttatttaagt 180 
35 gtcggatttg tgagtatttt gcgtttttag ttttcggata gaagttggag aatttttttg 240 

gagaattttt cgagttagga gacgagattt tttaataatt attatttttt tttgcgtttt 300 
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ttatttgtcg ttcgttggga taaacgatag ttatagtttt tttgacgata ggatggaggt 360 
taagggta 368 

<210> 9 
<211> 352 
<212> DNA 

<213> Artificial Sequence 



<220> 
10 <223> 2212 

<400> 9 



15 



ttgttgggag 


tttttaagtt 


ttgtgagaat 


tttgggagtt 


ggtgatgtta gattagttgg 


60 


gttatttgaa 


ggttagtagt 


tcgggtaggg 


tttatcgaaa 


gtttattcgt 


atatattagg 


120 


taatttaatt 


ttttattttg 


tgtgatagaa 


gtagtaggaa 


gtgagttgtt 


tagaggtagg 


180 


agggtttatt 


ttttgttaaa 


ggggggatta 


gaattttttt 


atgcgagttg 


tttgaggatt 


240 


gggatgtcga 


gaacgcgagc 


gattcgagta 


gggtttgttt 


gggtatcgtc 


ggggtaggat 


300 


tcggaacgta 


ttcggaaggt 


tttttgtaag 


tatttatttg 


gaaggagaat 


tt 


352 



<210> 10 
<211> 295 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 2267 
<400> 10 

30 

gtaatttgaa gaaagttgag gggaggcggt agatgttttg atttattagg gaaaacgtgg 60 
acgttttttg ttgttatttt gtgaattgtg tgtatttagt tatttttgag taaatatttg 120 
gagcgaggaa tttttgagtg gtgtgggagg gcggtgaggg gtagttgaaa gtcggttaaa 180 
gttttcggag gggttggttt aggaaatatg attggtagtt acgagagagt taggggttgg 240 
35 acgtcgagga gagggagaag gttttcgggc ggagagaggt tttgtttagt tgttg 295 



138 6 -WO 



63 

<210> 11 
<211> 278 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> 2317 
<400> 11 

10 



ggagttgtat 


tgttgggaga 


tttgggtgta 


gatgatgggg 


atgttaggat 


tattcgaatt 


60 


taaagttgaa 


cgtttaggta 


gaggagtgga 


gttttgggga 


attttgagtc 


ggtttaaagc 


120 


gtattttttt 


gtatatttat 


tcggtgttgg 


gcgtagggaa 


tttttgaaat 


aaaagatgta 


180 


taaagtattg 


aggtttgaga 


tttttggatt 


tcgaaatatt 


gagaatttat 


agttgtatat 


240 


tttagagttt 


atggtatttt 


agtgaaaatt 


ggggtttt 






278 



<210> 12 

<211> 285 

<212> DNA 

20 <213> Artificial Sequence 

<220> 

<223> 2383 

25 <400> 12 



tttgtattag 


gttggaagtg 


gtcgttagtt 


tttcgtgtaa 


ttttattttt 


tggaaaagtg 


60 


gaattagttg 


gtattgttta 


gcgtgatttg 


tgaggttgag 


ttttaatagt 


ttaaagaagt 


120 


aaatgggatg 


ttattttcgc 


ggggttcgtt 


tttcgcgagg 


tgtttatttc 


gtatttgtta 


180 


tgtaaaacga 


gggagcgtta 


ggaaggaatt 


cgttttgtaa 


agttattggt 


tttggttatt 


240 


agtttttatt 


taatgttttc 


gtgatgttgt 


tgttgattta 


tttgg 




285 



<210> 13 
<211> 380 
35 <212> DNA 

<213> Artificial Sequence 
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10 



<220> 

<223> 2387 
<400> 13 



gatttttgga 


gaggaagtta 


agtgtttttt tgtttttttt 


cggtatttta 


tttaaggcga 


60 


ttagtttaga 


attggttttc 


ggaagcgttc gggtaaagat 


tgcgaagaag aaaagatatt 


120 


tggcggaaat 


ttgtgcgttt 


ggggcggtgg aattcgggga 


ggagagggag 


ggattagata 


180 


ggagagtggg 


gattattttt 


tttgttttta aattggggta 


gttttttggg 


ttttcgattt 


240 


ttttattttc 


gtgggtaaaa 


aattttgttt ttatcgggtt 


tacgtaattt 


ttttaagggg 


300 


agaggaggga 


aaaatttgtg 


gggggtacga aaaggcggaa 


agaaatagtt 


atttcgttat 


360 


atgggtttgg 


tttttagttt 








380 



15 <210> 14 

<211> 397 

<212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> 2391 

<400> 14 

25 



tggggttagt 


ttaggatagg 


cgttcggggg acgcgtgttt 


ttattttacg gggacggtgg 


60 


aggagagtta 


gcgagggttc 


gaggggtagg 


tattttaacg 


aatggttttt ttggtgtttt 


120 


ttgcgtttcg 


tcggtttatt 


ttttttttta 


taaaacgggt 


ttagttttta gtatttattt 


180 


ttcgttatta 


attaggtatt 


tcgggagatt 


agttcgttcg 


aaagtttttg cgttatttcg 


240 


cgggtttttt 


taggtggttt 


ttttagtttc 


gttttttttc 


gggatgtttg ttgattattt 


300 


cgagttcgcg 


tggcgtaaga 


gtacgagcgt 


cgagttcgtg 


cgcgttaagg ttgcgtgggc 


360 


gggtatcgat 


ttttttgaga 


agttttagtg 


tttttaa 




397 



30 



<210> 15 

<211> 547 

35 <212> DNA 

<213> Artificial Sequence 
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<220> 

<223> 2395 
5 <400> 15 



10 



15 



tttttgtatt 


ggggtaggtt tcggtaggtg tatgggagga 


agtacggaga 


atttataagt 


60 


ttttcgattt 


tttagtttag acgttgttgg gtttttttcg 


ttggagatcg 


cgtttttttt 


120 


aaatttttgt 


gagcgttgcg gaagtacgcg gggttcgggt 


cgttgagcgt 


tgtaagatag 


180 


gggagggagt 


cgggcgggag agggaggggc ggcgtcgggg 


cgggttttga 


tatagagtag 


240 


gcgtcgcggg 


tcgtagtata gtcggagatc gtagttcgga 


gttcgggtta gggtttattt 


300 


gttttcgtag 


cgtcggttcg cgtttttttg tcgtagttat 


cggtgagtgt 


cgcggttttg 


360 


agattttcgg 


gtcggatgcg cggcggtttt agttttcgag 


cgtttgtttg 


tttcgttttg 


420 


ggttgttcgg 


gttttttggg tttttcggcg gttgtacgga 


gttaaggcgt 


ttcgtttcgg 


480 


gcgtttttcg 


cgggtgtcga tttaggttgt tcggagttcg 


gagtttatag 


aggagagaga 


540 


tagttgg 








547 



<210> 16 
<211> 414 
2 0 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 2401 

25 

<400> 16 



30 



attagaagtg 


aaagtaatgg aatttcgatg 


taaatataat 


attatttttt 


tgtagagtta 


60 


ttttgagtat 


aataaatttg aattgtgtta atgttgggag aaaaaattta aaagaagaac 


120 


ggagcgaata 


gtagtttttt cgttcgttga 


ttagaaatag 


taggacgata 


ttttttcgat 


180 


tggaggagag 


cgtttgcgtt cgtatttagt 


tggcgttcgt 


ttttttgttt 


tttttttagt 


240 


cgtttttttt 


tttttttttc gcgttttagt 


tattcgggaa 


ggtattgcgg 


tagttgggtt 


300 


ttgattggtt 


gttttgaaag tttacgggtt 


attcgattgg 


tgaattcggg 


gttttttagc 


360 


gcggtgagtt 


tgaaattgtt cgtatttggt 


tttaaagttg 


gtttttggaa 


attg 


414 



<210> 17 
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<211> 272 
<212> DNA 

<213> Artificial Sequence 



5 <220> 

<223> 2453 



<400> 17 



gggatgggtt 


attagttgta 


aatcgtggaa 


ttttttttga 


tataatgaaa 


agatgagggt 


60 


gtataagttt 


tttagtaggg 


tgatgatata 


aaaagttatc 


ggagtatttt 


ataaggtata 


120 


aatttttaga 


gatagtagag 


tatataagtt 


tttaggataa gagttaggaa 


gaaattatcg 


180 


gaaggaatta 


ttttattgtg 


tgtaaatatg 


atttttaagt 


tggtcgtggt 


ttttttggta 


240 


gtttttttga 


tttttgtagt 


tttgtgtgaa 


gg 






272 



15 

<210> 18 
<211> 391 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> 2484 



<400> 18 

25 



taattgaagg 


ggttaatagt ggaatttggt 


tgggtgtttg ttaaattttt 


ttttttggtt 


60 


ttgttttggg 


tttttttttg aagggatttt 


ttttcgtttt tgtaataaga 


ttttttataa 


120 


agtatagatt 


ttttatttta tttcgcggta 


tttgtatcgg gttttattgg 


ttttaggagt 


180 


tgaatatttt 


tttaggtata tataggtggg 


atataaataa gggttttgga attattattt 


240 


ttttattacg 


atagtaattt aaaatgtttg 


ggaagatggt cgtgattttt 


ggagttttaa 


300 


atatattttg 


gataatgttt gtagtttgta 


agttattttt ttttatttgt 


tttaaatgtt 


360 


agtatttaat 


tttagttttg gttttggttt 


t 




391 



<210> 19 
35 <211> 430 

<212> DNA 



1386-WO 



67 

<213> Artificial Sequence 





<220> 






5 


<223> 2512 








<400> 19 








agtggatttg 


gagtttagat gtaatataat gattgatatt ggtatagtat atttattttg 


60 




tttttgtaaa 


taaaatggta tatgtgatgt ttttttttgt ttttttgtat ataaaataat 


120 


10 


atttgttttt 


atttattatg tatttatgtt tttattttgt atgttaggag ttaagtattt 


180 




tgtatgtatt 


aatttatttt gtttttataa taatttttat atgtaggaat tattatagtt 


240 




attttatgaa 


tgagtcgagg aaggtattga gacgttaagt aatttgttta aggttacgta 


300 




gttagtaagt 


ggtagagtaa gaattattat ggttttataa gtttaggaaa aagtttgaaa 


360 




gaattaaaat 


gttaatagcg gggattttaa ggaagtattg aagaggttat gggagaagtt 


420 


15 


tttattttgt 




430 



<210> 20 

<211> 475 

<212> DNA 

20 <213> Artificial Sequence 

<220> 

<223> 2741 

25 <400> 20 



taggggaaaa 


gttagagttg 


agaggttggg gcgcgacgag 


tttggatatc 


gggcggggat 


60 


ttaagttttt 


ttcgtttagt 


taataattgt 


gtttttttta 


ggaaggcgtg aggaaatgtt 


120 


ttaattaatt 


tttgtatttt 


ttttttggaa tttgggttgt 


atttttttat 


ttattgtaaa 


180 


ttttataatt 


tatttagggg 


tttttttagt 


gtttgttttt 


agcggtttcg gtgtttattt 


240 


attagtgttg 


tttttttttt 


ttcgtaagat 


tgcgttttag 


ttttagtttt 


ttttttcgcg 


300 


ggtgtttttt 


aaatcgtttt 


attattttcg 


ggtttaggga 


ggcggaatcg 


tgtttgtttt 


360 


tcggtttttt 


taagaggcgt 


cggttttatt 


ttttttagag 


tcgcggtttg 


acgcgagatg 


420 


atagtaacga 


gttcggtatg 


tttatgtaaa 


taagcgtttt 


tttgtgggtt 


aatgg 


475 



35 



<210> 21 



1386-WO 



<211> 412 
<212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> 2745 

<400> 21 

10 attttagttt gtgaaatggg atttaggatt 

cgagtatttt gtgcgtttcg gtaacgtagg 
gcgggtatcg tttggaggcg tttttattat 
ttataagatt tacgttcgaa ttacgtgatt 
gcgcgtcgtt ttcggtaggg gcggaaagcg 

15 aggaggttcg cgggaggatg gagtagtgag 

gagacggtat agttgaggaa ttcgtcgcgt 

<210> 22 
<211> 484 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 2746 

25 

<400> 22 

gtgggttttg ggtagttata gaagttatcg 
tttatgtttc gggtagtttt attttttttg 

30 gagaggcgcg ttttattcgg gtttttttat 

tttcgttcgg cgttttagtc gattttcgtt 
ggcgggttag ggcggcgggg cgcgatattg 
gttaaggaag ttgtttaaaa tgaggaagag 
cggcggttgg agagcgagga ggagcgggtg 

35 tggcgtaggt aggtgtggtc gcgtttttta 

ggtt 



68 



taggtagagg 


tgcgttttcg 


gtttggggat 


60 


aagatagcgt 


tattgatatt 


ttagagatta 


120 


ttggcggttt 


cgggttcgcg 


ttttatcgcg 


180 


agggtcgtgg 


tttcgtttcg 


ttttcgcgtc 


240 


gaagtgtggg 


agggtttgcg 


gggcgggttt 


300 


cgggtttggg 


cggttgttgg 


tagcgttatg 


360 


cggtgagggg 


ttattggtta 


ag 


412 


cgttggcggg 


gaggaggggg 


atcgatgcgg 


60 


tttgcgaagg 


gtttttgttc 


ggcgggagga 


120 


atttgtcgtc 


gtttgggtcg 


atttcgcggg 


180 


tagtttcggg 


tttatgggcg 


cggttagtag 


240 


ggaggaagtg 


cgggtcgttt 


gttcgggcgc 


300 


tcgcgggttc 


ggcggttgag 


gttatttcgg 


360 


gtttcgcgtt 


gcgttcgttt 


tcgttttatt 


420 


ttcggtcggg 


attttttggt 


aaggagagga 


480 



484 
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69 

<210> 23 
<211> 476 
<212> DNA 
5 <213> Artificial Sequence 

<220> 

<223> 2747 
10 <400> 23 



20 



30 



35 



taggatgggg 


agagtaatgt 


tttcgagtag 


aatagggtgg 


ggtttttaga 


ttattttttt 


60 


ttttttatag 


ttggttttat 


tttatcgatt 


ttattaaagt 


ttttttggga 


gtattttaga 


120 


gaagagttac 


gtttaggtcg 


ggttttggtt 


gtttggttta 


cggcggaatt 


tttagtatta 


180 


cgtttcgtac 


gtcgggttta 


aagtatgttt 


agtgaaggag 


taggtattta 


ttgttagatg 


240 


gagttatttt 


tttagatttg 


gggttttttt 


ataacgatgg 


ttatgtttgg 


tatggaagtt 


300 


tttttagaag 


ttaatagtag 


gaaataaggg 


ttaatagtat 


ttaattgtgg 


agtaaggttt 


360 


aaattttagt 


tttgttattt 


aatcgtttcg 


aatttgtttt 


tttattgtag 


aggcgaaaag 


420 


gttaatatta 


ttttatttcg 


gagggttatc 


gtggagaatg 


gaagttggat 


aagttg 


476 


<210> 24 














<211> 419 














<212> DNA 














<213> Artificial Sequence 










<220> 














<223> 2749 














<400> 24 














tcccacaaaa 


actaaacaat 


tattacaaat 


tcaaaaaacc 


ccgaccaatt 


tttcaaaaat 


60 


ttctcctcct 


cttttccccc 


taaaactcgt 


aatactttta 


ctctactttc 


aaaatacatt 


120 


aaatctccta 


ctttataact 


actttaaaac 


caacaaatac 


tctaatatat 


ataattcaaa 


180 


ttatacaaat 


ttcacgaata 


aatttaatct 


tattttttaa 


attaattaaa 


aaacaaataa 


240 


tatttaaaaa 


aatattaact 


tataattatt 


tcaccctttt 


tactttaaac 


atttttatta 


300 


cttctcgacc 


ttttaactaa 


aatcaaatat 


atactttaaa 


cattttttaa 


aataaaaata 


360 



1386-WO 



tccttttaat ttaataaaaa aacaaaattc 

<210> 25 

<211> 479 

5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 2751 

10 

<400> 25 

tttggagggt ttagtagaag ttattttagg 
gtcggagggg tatataggag tttttttttt 

15 tttttaggtt gggttaatag gatgggatag 

tatttcgtag atagaagtag ggatatgggg 
ttgagttggc gtttttttag tagatttagg 
tattttcgta gttttgggtt tttttggttt 
ttgttgttcg ttagtttggg ttttagatat 

2 0 agtattggta gtttttaaaa gaattaattt 

<210> 26 

<211> 484 

<212> DNA 

25 <213> Artificial Sequence 

<220> 

<223> 2752 

30 <400> 26 

atacaacctc aaatcctatc caaaccccca 
tttttacttt tacctcccac taatactaat 
atacttaata ttttttctca aatactacca 
35 tcctttttaa aaataaaaaa tatatatcgc 

ataatttaac catcaccgta attcattcaa 



70 



tacataaaaa 


aaccccttca 


tctaaaacc 


419 


ggagggttcg 


ataggaagga 


aggtaggttt 


60 


cgttatagtg 


tttagggtta 


attgttttag 


120 


tttaggcgga 


aggaaatttg 


tggggaggga 


180 


tggggagagg 


taggaagagt 


tgtcgggttg 


240 


aggggcggtg 


ataggaggtt 


attttttttt 


300 


tggttaatag 


tattattatt 


attattattg 


360 


attagaaaaa 


aattatcgga 


agatacgtat 


420 


tttttttgtg 


tttattttgt 


gattattgg 


479 


aaacatcaca 


ctcgaaactt 


attctacata 


60 


tcttccgtaa 


aacaacctaa 


atcccttcaa 


120 


taaaaccaaa 


tctccaccgt 


cttaaaacat 


180 


tccttttata 


taatttacat 


tctatcttaa 


240 


atctatttaa 


atcctaccca 


tctcaacttc 


300 



1386-WO 



71 



aatccatttc attcttttaa atctaatcga caattacctc caacaacttc atcacaaatc 360 

actcacaaaa ataaccttaa tcctaaaatt tatttacgaa aaacacactt actaaatata 420 

taacaaatat acaaaaaaca caaaataaaa caacaaatct aaaaacaaat aacttccttc 480 

tccc 484 

<210> 27 
<211> 371 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 2755 



<400> 27 

15 



ggaagatgag 


gaagttgatt 


agatattaag 


gatgagcgga 


tgatttaata 


ggtttttttg 


60 


ttaagatttg 


gttgggtagg 


tgaaagataa 


agtcgaggag 


tggttatggt 


gtggtataga 


120 


agaagggtta 


gaggacggtt 


tttgttattt 


ttttatgttt 


gagttttttt 


ttttgtgaaa 


180 


tggggataat 


aagagtcgtt 


atatagggaa 


ttgttgttag 


gattaaatga 


gataatgtat 


240 


gtgaaacgtt 


ttggttgtag 


gttttttagt 


aaatgggtac 


gatttgcgga 


gtggggattt 


300 


gaatttacgt 


ttggcgggat 


gtttaagttg 


ttattttgat 


cgttagggag 


ttttagagga 


360 


tagggttgta 


g 










371 



<210> 28 

25 <211> 186 

<212> DNA 

<213> Artificial Sequence 



<220> 
30 <223> 2831 



<400> 28 



ttagtagggg tgtgagtgtt ttgattagaa ttattttttt ttgttagaat ttgatgtaat 60 
35 tcgaatgttt ttatttttgt ttgaagggtt taaataataa attaggtttt gtcgtgttat 120 

tatgggggtg gttatatttt gtatttagga aataggtacg gtagggttga gatagaagtt 180 



1386-WO 



72 



ttgttt 186 

<210> 29 
<211> 300 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 2850 



10 



35 



<400> 29 



ttatagggtt gagtttggga tcgaggtgag agtcgtcggg ttgggagtga gggagatggg 60 

aataaggtcg tcggtgggcg aggggagtcg agggaattcg ggggattggg aggtttgggg 120 

15 cggcgcggtt tggtcgggtt gggatcggtt tttcggttta gacgttcgcg atgttggtat 180 

tttttgttat tttttatttg ggttttaggg gttcgttttt gggtagtttg gagtttttcg 240 

aggtgggagg atcgggcgga ggtggaggaa gttttttttt ggaagatttg ttgtttgttt 300 

<210> 30 
20 <211> 321 

<212> DNA 

<213> Artificial Sequence 

<220> 
25 <223> 2852 

<400> 30 



30 



tgaaaatgaa 


ggtatggagt 


ttggtgttaa aagaaatttt 


ttttaaaaat 


taaataataa 


60 


tattagagta 


aagtttttag 


ggcgagataa ggagttgtaa 


taaaataagc 


ggaaattcga 


120 


gaagcgttaa 


tgttttaaag ggttaatgat tatatataat 


ttacgtagtt 


aacgtgttaa 


180 


aatatattaa 


cgtatttttt 


ttttttaaat aaagtaggaa 


agcggatttt 


gtatgagggg 


240 


cgggttgtcg 


atttagtagt 


ttttttcgga tagttcgttt 


tgattttttt 


tggttggtcg 


300 


tggagggatt 


atatggtttt 


a 






321 



<210> 31 



1386-WO 



73 



10 



25 



30 



<211> 398 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 2859 
<400> 31 



tatgtttggt 


tttgttttga 


gatagagttt 


cgttttgtcg tttaggttgg 


ttaaaagata 


60 


gggttttagt 


cgggtgcggt 


ggtttacgtt 


tgtaatttta gtattttggg 


aggtcgaggc 


120 


gggcggatta 


tttgaggttc 


ggagttcgag 


attagtttgg gttaatatgg 


cgaaacgttg 


180 


tttttattaa 


aaataataaa 


aattatttag 


gcgtggtggc gcgtatttgt 


aattttagtt 


240 


attcgggagg 


ttgaggtagg 


agaattattt 


gaatttagga ggtagacgtt 


gtagtgagtc 


300 


gagatcgcgt 


tattgtattt 


tagtttgggc 


gatagaggga gatttcgttt 


taaaaaaagg 


360 


aaaaaaaaaa 


aaaagaaaag 


aaataaaagt 


gatggggt 




398 



15 



<210> 32 

<211> 347 

2 0 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 2861 



<400> 32 



gggtgtagaa 


gtgtttaggt 


tttttttcgt 


tggggttggg agtttgggta ggttagtttt 


60 


atttttttta 


agttcgtttt 


tggttttcgg 


gtttagtttc ggttattatg 


tttcgttaga 


120 


ttatttttgt 


gggttttagt 


tgtttggatt 


tgtggaggga aaagaatgat 


cggttcgttc 


180 


gataggttaa 


ggtaatacgg 


ttgttggtat 


tttcggtttg tagttttaag 


atttttgaaa 


240 


gcgggtttgt 


agtggattta 


ttttaataga 


tggggaggga ttgagtttga 


ttaaagagtt 


300 


agaaatgatt 


ggagaatgta 


ttttttgtta 


ttgttgtaag gggagaa 




347 



35 



<210> 33 
<211> 291 



1386-WO 



74 

<212> DNA 

<213> Artificial Sequence 

<220> 
5 <223> 2864 

<400> 33 



15 



tccccttcca 


actatatctc 


tcacccaaaa 


ataacttcta 


actctcgtat 


tcatctaaaa 


60 


ctcctccttc 


catataccaa 


caattaacta 


taacccctcc 


aaaaacgctc 


catctccaaa 


120 


tatactccca 


catccaaacc 


acgaacccct 


cacccgatca 


catacttcat 


acacctataa 


180 


ctccgcactc 


cccaaatata 


cctctaacgt 


acaactatta 


ccccttcccc 


cgattataac 


240 


cctataactc 


gccacataca 


actataacta 


aaacttccct 


aaaacactct 


c 


291 


<210> 34 














<211> 389 














<212> DNA 














<213> Artificial Sequence 










<220> 














<223> 2867 














<400> 34 














aaaaccaaaa 


cataaaccaa 


aaaccaaact 


cgaaccgaaa 


acaataaccg 


caacgcccga 


60 


aaactaaacc 


cacgacgcgc 


taacaacgcg 


aaccgaacta 


cgaaaacgat 


cacgtcaacg 


120 


tccgttccaa 


accgactaac 


aatctccgtt 


ctacattaac 


gtcaacactc 


ccgttaaaaa 


180 


taatacatct 


ctcccatacc 


aaaaaaactt 


aaatactact 


aaaaaccaac 


cctccgaata 


240 


ctaccaaacc 


gacgctcacc 


cgccaccttc 


atcttccctt 


ctcctttacc 


ccaaaacaac 


300 


cgaaaatata 


taattaaatt 


ccccctaccc 


ataaaaaaac 


caaaaataaa 


aaactaacga 


360 


cctactcgat 


ctcaacaaac 


cctcctaat 








389 



<210> 35 
<211> 272 
35 <212> DNA 

<213> Artificial Sequence 



1386-WO 



75 



<220> 

<223> 2961 
5 <400> 35 



aatggttgat 


gattttggtt 


ttttttcgtc 


gtcggagagc 


ggtgtttcgg aggcggcgga 


60 


ggaggattcg 


gcggtcgttt 


ttttggttta 


gtaggagagc 


gagattgtag gtatagagaa 


120 


cgacgagggt 


ttcggggtat 


ttgtcggtag 


ttatgcggtt 


ttcgcgtagt cgggttttac 


180 


gagtgggggt 


gagttagcgc 


ggggtttgga 


gaggggttta 


gggcgcgtat tcgggggatt 


240 


tcggtcgggg 


tttaggggta 


tagggaagag 


ag 




272 



<210> 36 

<211> 371 

15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> 3511 

20 

<400> 36 



agttagaaga 


ggagttagga 


tgggtttcgg gtagtttaat 


agtatagttg aagttttaat 


60 


tattatgtta 


atagtttttt 


ggttttatat 


attttatggg 


aagaggaaaa taaaaaggta 


120 


tttatttgta 


tattttttta 


tttttgatat 


aagaagtaga 


atttttttta tatgatttat 


180 


gtttatttaa 


tacgttattt 


tgaaatttat 


taataaaatt 


ttttaagcgt tagaaaattg 


240 


ttagtggttt 


tttttatttt 


tttttatttt 


tttttgtgtt 


attaattttg tttttttttt 


300 


ttagaaggtt 


gtcggaatag 


taaatattta 


ttgatatgtt 


ataattattg gaaaatgggt 


360 


attggaaaat 


t 








371 



30 

<210> 37 
<211> 457 
<212> DNA 

<213> Artificial Sequence 

35 



<220> 



1386-WO 



76 

<223> 3532 
<400> 37 



tgttagtaga 


gttttaggga ggttttattt 


tttattttta 


tttaaagttt tatttgttgg 


60 


ggtgggggtt 


ttgtttggaa ggggaaggtt 


taaggttgtt 


tttagcgtgt ttttttattt 


120 


tgattgtttt 


tggcggggcg ggggtgtttt 


tgttatttag 


ttgtataacg gttaggaagg 


180 


gtttaaatta 


tttttagggt taatttaagg 


tcgttttttg ggtttgtata tttttgtgtt 


240 


gagtgcggat 


cgggagaggt tgttgaagat 


aggaggggat 


aaatggggga cgaaggggtt 


300 


cgagggaggg 


gattgaagga tttgggttaa 


gtcgggagtt 


ttcgagggcg gagttaaaac 


360 


gtatttggat 


tttgttagtt ttaaattttg 


tttttattgt 


tgtaagtttt ttagatcgag 


420 


gattttcggg 


ttgagggtgg ggtaaggata 


ggtagtg 




457 



<210> 38 

15 <211> 476 

<212> DNA 

<213> Artificial Sequence 

<220> 

20 <223> 3534 

<400> 38 



tttttgtttt 


tatggggtgt 


atatttaagt 


agttgaaata gatagtgaat 


aaataaaaaa 


60 


ggataataat 


tttaaataat 


aatgatgtta 


tcggttaggt 


gtggtggttt 


atgtttataa 


120 


ttttagtatt 


ttgggaagtt 


aagttaagcg 


gattatttga 


ggttaggagt 


ttaagaatag 


180 


tttggttagt 


atggtgaaat 


tttattttta 


ttaaaaatat 


aaaaattagt 


tagatatggt 


240 


ggtatatatt 


tgtaatttta gttatttggg 


aggttgacgt 


aggagaattg 


tttgagttcg 


300 


ggaggtggag 


gttgtagtga 


gttaagattt 


gataggtttt 


tagtattatt 


gtattttaga 


360 


ttggttgata 


gagcgagatt 


ttgttaaaaa 


aaaaaaagtt 


ataaatagat 


tttaataggg 


420 


taatatgata 


gggagggagg gataggggag 


tagggtggtt 


aaggaaggga 


tattta 


476 



<210> 39 

<211> 458 

35 <212> DNA 

<213> Artificial Sequence 



1386-WO 



77 



<220> 

<223> 3538 
5 <400> 39 



tgggtagtat 


ttttgttggt ttttttttat 


attataaggt 


tacgtagagt 


tggcggaggg 


60 


ttatggtttt 


atttatgtta ggtgttttta 


atttggtaag 


gaaatgtaat 


ttacgtgaat 


120 


tttaataggt 


agtgaagtat cgtttttttt 


tgattttagg 


tagggtgaag 


aaaatgggat 


180 


agtagtacgg 


ggtgcgggta taaacgtata 


attttgtttt 


tttagacgta 


gagttgtggg 


240 


gttgtgagaa 


tgttaggagg aggtaagaaa 


gggcggtttt 


atggggggtt 


tgtagggtgg 


300 


gataagttta 


agaggttttt atatttaggt 


ttggtggggg 


aggtgagttt 


ttggtttatc 


360 


gagggggttt 


ttttttgttt tcggaaatat 


tgtagttttt 


atttttatcg 


ttttttcgtt 


420 


gcggggattt 


aggggcgtga ggatgagaga 


gtttttag 






458 



15 

<210> 40 
<211> 405 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> 3540 
<400> 40 

25 



agtggtttag 


gagtatttgg ttattttcgg 


gaaaaatcgg 


tttggtaaag 


gttttttcga 


60 


gggtacgcgt 


ttttcggata gtgaggtagg 


atttaaattt 


tttcgttaat 


attatatttt 


120 


tcgtattttt 


gtagtgtttg tatttttagg 


ttttattatt 


ttttcgtatt 


ttttagggag 


180 


aagttttcga 


cgttttattt tttttggaag 


ggtgttgttt 


ttagagattt 


ttaggttaat 


240 


ggtttaattt 


tagtgttttt aggggagagg 


ggggtgtaga 


aaaatagttt 


gggttataaa 


300 


agaggtgcga 


gggttgtgag atttcggagg 


tatcgacggg aagcgagacg gagaatagga 


360 


gggtaggacg 


ggttggaggt gggggatatt 


gtagatggag 


ggagt 




405 



<210> 41 
35 <211> 2501 

<212> DNA 



1386-WO 



78 

<213> Homo Sapiens 



<400> 41 



5 ccagttccag 
agatacccag 
caattaaggg 
caacccccta 
cgcttctggg 

10 actcgcgcca 
gctagagtcc 
cagcgcccgg 
acgctgaaaa 
tacctctgct 

15 gagagctgtt 
caggtttgat 
tcggaggcag 
ctgtttttta 
gtgtgtgtaa 

20 agcactgtca 
ggccggaata 
ctgtgcaaac 
ctggatggag 
ggccccctcc 

25 gagattcgga 
tgtagatccc 
gatgagattg 
ccatgatcac 
ttgctcccag 

30 tactggagtc 
tcacttgggc 
cgccaaatcg 
aggtgaaatt 
cttgttggtg 

3 5 tgggacggtc 
caaaggaccc 



tcccgggtcc tgtggccgcc 
tccccagccc cgagttgtta 
catcttagaa gttaggcgtt 
gggggaagag atgtagcgcg 
gactgaggac aaaggtgcgg 
cggacggcac acctaggggc 
ttgcctccgc ccacagcccc 
accggctgca gcagatcgcg 
gcttctttct tgccactctg 
gagctgaacg ctcaggcaca 
gaggtcacgc gtaattgggt 
gcaagcaggt catcgtcgtg 
gcttgggaca cgtttgagtg 
gtgtctgtga ttcagagtgg 
atgagtgtga ccggaagcga 
cacctgcctg ctctttagta 
gaatgtctct gggacatctt 
ggctcaggca ggtgatggat 
gcttccgcat ctgtaccttg 
tcacacatgt aatggatctg 
aagtatccat ggagatctta 
tgcctatctg aggcccaggc 
tggtccccat ttcaaagatg 
acaaccagga attgggccaa 
ctccgtctct tgtttcccac 
agatttcttg ggcccagaac 
aggggtctgg gatcagagtt 
tgaaaccagg ggtggagtgg 
caccatgacg tcaaactgcc 
cccccaccat cccccaccat 
ctgctggatc ctccaggttc 
ctccgcccat tggttgctgt 



ctgccggcga ccctgcggag 
ttccctcgct gtagttaaga 
cccgctgcct cctttgagca 
aggcaggggt gtcgtgctaa 
acacgacccc ggggtacctg 
taatttctgc tctgcctcaa 
gggatgccgc tgctgcgctc 
cgctgcgcgt tccaccggga 
gacgctgtgg gcggcaagcg 
gtggaactga aacccggttc 
gtgatggagg gcgcctgttc 
cgagtgtgtg gatgcgaccg 
aacacctcag gatactcttc 
gcacatgttg ggagacagta 
gtgtgagctt gatctaggca 
gaggactgaa gtgcgggggt 
ggcaaacagc agccggaagc 
ggcagggtag gaagggggag 
caactcaccc ctcaggccca 
aagagtaccc cgggacagtc 
cagaatcccc tgtgcggacc 
gctgggctgt ttctcacaat 
agtacactga gcctctgtga 
ctgtaattga actcctgtct 
gagccctggc cctctgtggg 
ccacccttag gggcattaac 
ggaagagtcc ctacaatcct 
ggcgagggtt caaaaccagg 
ctcaaattcc cgctcacttt 
ttccatcaat gacctcaatg 
tggaagcatg agggtgacgc 
gcactggcgg aactttcccg 



agcgagtctt 


60 


aggaggagat 


120 


cggaggccac 


180 


gaaatttcga 


240 


gagttccgtg 


300 


agaacctcaa 


360 


accgcacagg 


420 


gatggtggag 


480 


ccttagtccc 


540 


tgcgggatgt 


600 


gtgatgtgtg 


660 


cccgagagac 


720 


tggccagtat 


780 


atgggtttgg 


840 


gggaccacac 


900 


gggggtacgg 


960 


aaaggggcag 


1020 


gtccagaggt 


1080 


gcaggtcatc 


1140 


cggggagatg 


1200 


aggaaactct 


1260 


attccttcaa 


1320 


agttacttgc 


1380 


aacaaagttc 


1440 


taataccagc 


1500 


ctttaaaatc 


1560 


ggaccctttc 


1620 


ccggactgag 


1680 


aagggcgtta 


1740 


caaatacaag 


1800 


aacccagggg 


1860 


acccacagcg 


1920 



I 
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35 



gcgggaataa 


gagcagtcgc 


tggcgctggg 


aggcatcaga 


gacactgccc 


agcccaagtg 


1980 


tcgccgccgc 


ttccacaggg 


ctctgctgga 


cgccgccgcc 


gccgctgcca 


ccgcctctga 


2040 


tccaagccac 


ctcccgccag 


gtgagccccg 


agatcctggc 


tcaggtatat 


gtctctccct 


2100 


ccctctccct 


ccattcgtca 


ttttctcact 


ccctttcctc 


ctctccctct 


ctctccgtta 


2160 


gtctcttcat 


cagatagtct 


ctgttagtcc 


gcgatttata 


ccaggctcgt 


gccctaggtt 


2220 


ggatcggaca 


gtctcaatcc 


cccggctcgc 


tcttcctgct 


cggctgcgga 


ctccagtctt 


2280 


actctctcgc 


actgcacaca 


ggcttaggcc 


agtctcggga 


cactcaggct 


ccccagggac 


2340 


cgcgcacaga gcctgaggca 


agagaaactt 


tccgcagacg 


gtgcgatcag ggacggcgtc 


2400 


tggagcccag 


cagtcccagg 


gaaattggtt 


cagaacctgg 


aacagagcgg atgggtggca 


2460 


aataggcacg 


acgactgagg 


gacaagcagc 


cctaaactgc 


a 




2501 


<210> 42 














<211> 2501 














<212> DNA 














<213> Homo 


Sapiens 












<400> 42 














agatttactc 


aaatttaaga 


atgagaatac 


aaatccacat 


cttgaagtgt 


ttcacagaaa 


60 


ggtctatctt 


aatgtctgga 


gtatatattt 


caatgaacat 


tcattttatt 


ttatttctct 


120 


ccattcctga 


atcaagcaat 


cttgaatcta aagttgctat 


gattagcact 


gaaaagacca 


180 


ctggactatt 


aattgtgtga 


ctttgggaca gtaactttct 


gcaccttagt 


ttgtttacat 


240 


gttatacatg 


aaggttgaag 


tctgattctg 


ctctgtgact 


atcattctaa 


acatctgatg 


300 


aaatcaaatt 


tcagtgtttg 


gaatggtagt 


acaataaatt 


tactaagaat 


aaataattca 


360 


ctgcaaaaac 


acattgattt 


ccaaatgatg 


taactgacag 


ttatattact 


gcagagggct 


420 


gataaataac 


aaaagaaatg 


aaagatgcac 


atggtgagaa 


ctgaaattat 


cctgacaagt 


480 


cttctacctg 


tttatcactt 


aaaatcaatg accatgctga 


atgcctacaa attacaaaat 


540 


ataaaagaaa 


tcttataaat 


gcgcatgtac 


aggagtctaa 


gttactaaaa gttttaaagc 


600 


ataagtttaa 


accaaactaa 


tcaaagaagt 


tgagaggaaa 


aattggcttt 


catctttaat 


660 


cactactgtt 


ttgaggtcct 


atgtttaata 


taattttcta 


agtagaggct 


tcagagagaa 


720 


gagttgtgag 


gatactttca 


tatttgtgta 


gaaggaaaag 


tttgccatcc 


attctagtat 


780 


ccctagtgtt 


atactgatgt 


gcaccttgga 


tttattttgt 


tcctattgta 


taaactcata 


840 


cttgacttca 


aagaaaagga 


aaatccaaag 


tccctctttt 


ctaaggggac 


agaaatcctt 


900 


tgtgtcaact 


gtttgaccct 


tttctctgta 


aggtcctatt 


ggaaatcttt 


tgtaacacaa 


960 


tgcaggggac 


tcttccatgt 


gttgatgctg 


tttacacagt 


ggggtgggcc 


tgactgaaga 


1020 


aaaaaaatcg catatacgca 


tgaaagatta 


tggtcttatt 


tccggaaagc 


atgaaaggtg 


1080 
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attgatactt 


ccaagaagtc 


cctgttactc 


tacttggaaa 


gactgaagga 


aaggaagaac 


agagatttgt 


ggcagaggaa 


aagtattctc 


aatttcctgt 


ccagcctttt 


attaccaggg 


ttccctggga 


gtgcagtggg 


gaattcttac 


ttttctcagt 


gtctctttca 


ggttggcagg 


gacaggcaag 


ctctgtgggc 


gcgtaaacac 


gttttgtgtg 


ctcccgagaa 


gaagtgatcg 


taagagcctg 


ggggctcctt 


tcccctaacc 


gggggtgggg 


aaggagtggg 


agggcagtgg 


agtccctgaa 


tggggagcgc 


tgctgtcccc 


acgccaagag 


gtgggagtgc 


ctggggaggg 


agctgccttt 


ccgcccctcc 


gcctgctttc 


tgcggagcgc 


ttttggagcc 


tgtgaatgaa 


ctgagtctcc 


tcctcggctc 


tgacggtaca 


gcatttgaac 


ttgcaggcga 


gctgccccga 


gacgcaacag 


ccgagaacat 


taggtgttgt 


agccccgcat 


cctcccgcat 


cttccagcac 


ccaccacgct 


tcctatgtga 


cccgcctggg 


gtgaattttc 


cccccaaact 


gcaataagcc 


ccccccttaa 


aaaacaaaaa 


caaaaaactt 


gaagcaattg 


gtagaaggct 


ttaattaagg 


gtaggcactt 


aaaaaatgca 


ggtaactttc 


ggaccttggt 


gttgaaaaag 


attcagacaa 


<210> 43 






<211> 2501 






<212> DNA 






<213> Homo 


Sapiens 




<400> 43 






tgtgggtcat 


taatgcaatg 


ttatttaaga 


cctgtaatcc 


cagcactgtg 


ggaggccgag 


agaccagcct 


gaccaacatg 


gtgaaaccac 


catagtcaca 


tgcctgtaat 


cccagctact 



80 



aggaaaatta 


tcaaatattc 


tactcagaga 


1140 


gaagaaagca 


gaatctagac 


ttatgtgggg 


1200 


tttgaatccg 


acaagggatt 


tgcctggggg 


1260 


tcttttgaag 


ccgggctccc 


cattgggcag 


1320 


actttccctc 


taggtccccg 


aaggatctcg 


1380 


agccttgagc 


ctgacacttc 


cctttgatgg 


1440 


gctgtaacca 


agttctttgc 


tgattttaca 


1500 


tactcaattg 


tctattgctg gcctgccccc 


1560 


cagaactagc 


tgcacggggg 


gcggggaaat 


1620 


tttccgcgag 


cagagcgatg 


ttactgagtg 


1680 


aagccgattg 


gtacttcttg 


tcaggaagaa 


1740 


aggcaggcgg 


tccctaccgc 


aggcgcgggg 


1800 


caagcctgga 


ctcttaggag 


tggctgaagc 


1860 


ccctcctcct 


ctccctcctc 


cttcttctcg 


1920 


gtgatataat 


gatgatgggt 


gtcacaaccc 


1980 


gcctttctgg 


ggaagaactc 


caggcgtgcg 


2040 


ggacaggagc 


tgggaccaag 


atcttcggcc 


2100 


cgtcccgcac 


cctccgcatc 


cttccccggg 


2160 


caacgccgaa 


cccagtcgcg 


cagcgctgca 


2220 


gccttccaag 


gtaatcacgt 


ttcttttgtt 


2280 


atagaaaaaa 


acccgcgagc 


ttagaaaaaa 


2340 


caaagagctg 


taaggcgaag 


ttaagaaaat 


2400 


ataagggctt 


ttggggagag gcatacagag 


2460 


aagaaaccca 


g 




2501 



ctaggatttg 


gctgggcgca 


gtggctcacg 


60 


ccgggaggat 


cacctgaggt 


caggagttca 


120 


gtctctacta 


aaaatacaaa 


attagccggg 


180 


gggtagcctg 


aggcaggaga 


atcgcttgaa 


240 
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cccgggaggc ggaggcggag tttgcagtga gccaagattt cacaactgca ctccagtctg 300 

ggccacaaga gcgaaaaccc gtctcaaaaa aaaaaaaaag actaggattt gacataaggc 360 

ctgaggggta ttcttttgtt ttgttttgcc ttgttttcaa gaggccaaaa tcttcacagt 420 

tgaaaatttc tgttgaacca cagagatttg aaccaactca gtttagaaag cctggggatt 480 

5 tgaacaacgg tatggatcgg aaatctcttc atctgtcagt tttcatcatt ctaggcagta 540 

aaatagattt ccctttagga gcttttcacc gtttggggtt ctccagcagt gggatgtggg 600 

gaatcaaccc ttcttcgtct ccacccaaac attaggtggg agcaaggggt gggaagtaga 660 

gaaagtggat agaggtctcc agtggatatg ggatctttgt gtagaccagc acagtcctca 720 

gaaatctcat gcaagcaaca taggtactgt tatattttct agtggccacc ttttaaaaag 780 

10 taaacaggtg aggccgggcg cggtcgtcac gcctgtaatc ccagcacttt gggaggccca 840 

ggcgggcgga tcacgaggtc aagagatgga gaccatcctg gtcgacacgg tgaaaccccg 900 

tctctactaa aaatacaaaa attagctggg catggtgacg cgcgactgta gtcctagcta 960 

ctggggaggc cgaggcagga gaatcacttg aaccctggag gtggaggttg ccacgctcca 1020 

ctacactcca gcctggcgac agagtgagac tccgtctcaa aaaaaagaaa gtaaacaggt 1080 

15 gaaattaatt ttaataatat attttgttta acccaacgta tccaaaatac tatcatttga 1140 

aagtgtaatg aatataaaaa tattcatgag atatttttca ttctcatatc catactgtct 1200 

tggactctaa tgtgtatttt acacttacag cacaattaat ttgggactag ctacatttca 12 60 

gctcaacaat agccaatagc atatgggata gcgcaaataa actctgcgtc tctgttgctt 1320 

ctttgggtct cggagacctc aaccctttct tcagattgca aaccttcttg ccttcaagcc 13 80 

20 tcggctccaa caccagtccg gcagaggaac ccagtctaat gaggtacgct cccttcctgc 1440 

cattctctat tccattaacc tgtttcgtgg taaacgtagg actgatcctc caaaattacc 1500 

ttattaatta gcttacatat ttattatcta tctgtcccac cagaatgcag gtttccggaa 1560 

ggcagggatt taaaaaaatc tgttttgttc tatgtgattt tcccatacca agcaccgtgc 1620 

ccggcacaag ctgggatccc agtacacatc tcgggacgga agaaccgtgt ttccctagaa 1680 

25 cccagtcaga gggcagctta gcaatgtgtc acaggtgggg cgcccgcgtt ccgggcggac 1740 

gcactggctc cccggccggc gtgggtgtgg ggcgagtggg tgtgtgcggg gtgtgcgcgg 1800 

tagagcgcgc cagcgagccc ggagcgcgga gctgggagga gcagcgagcg ccgcgcagaa 1860 

cccgcagcgc cggcctggca gggcagctcg gaggtgggtg ggccgcgccg ccagcccgct 1920 

tgcagggtcc ccattggccg cctgccggcc gccctccgcc caaaaggcgg caaggagccg 1980 

30 agaggctgct tcggagtgtg aggaggacag ccggaccgag ccaacgccgg ggactttgtt 2040 

ccctccgcgg aggggactcg gcaactcgca gcggcagggt ctggggccgg cgcctgggag 2100 

ggatctgcgc cccccactca ctccctagct gtgttcccgc cgcgccccgg ctagtctccg 2160 

gcgctggcgc ctatggtcgg cctccgacag cgctccggag ggaccggggg agctcccagg 2220 

cgcccgggtg agtagccagg cgcggctccc cggtcccccc gacccccggc gccagctttt 2280 

35 gctttcccag ccagggcgcg gtggggtttg tccgggcagt gcctcgagca actgggaagg 2340 

ccaaggcgga gggaaacttg gcttcgggga gaagtgcgat cgcagccggg aggcttcccc 2400 
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agccccgcgg 


gccgggtgag 


aacaggtggc 




cgcgaggatc 


tggagcgaac 


tgctgcgcct 




<210> 44 






5 


<211> 2501 








<212> DNA 








<213> Homo 


Sapiens 






<400> 44 






10 










gatgtgaaaa 


gagaaataat 


tgaaaaagac 




tttaaagaaa 


caacattcta 


gcacaccttt 




gcaccagtac 


ccctgagtaa 


ccgaaagggc 




tgaacattct 


gtcttccagg 


acctgccttg 


15 


gtgttgtata 


cattaggggt 


agttgggtag 




aaaaacactg 


gggagtgaac 


ctttccatta 




tatatatata 


tatatatata 


tataaattca 




tggaaaacgc 


tatttccatt 


tttttttttt 




cttaaatcca 


tgctggctct 


tacaaatgtt 


20 


aaacggagag 


gctaaacaat 


ctgcaggcga 




tgcctgtgtt 


ctctgcttgg 


ctaggaacac 




aactctgttg 


tggagacatc 


tgcccctggg 




catcttactc 


catgtctcag 


agagctctga 




ctaaaaacct 


aatgatgaat 


aaaaataagt 


25 


aagctaccag 


gttaaatgac 


tgaaattcct 




gaaagggggc 


ccaatactga 


ataccaggaa 




tgggatccgg 


cgttcctatt 


tcatccgaat 




ttgcacacca 


ggtacccact 


ccctggtgtc 




gaatggtgag 


cccatgagcc 


tgctggaccc 


30 


tgccgtattt 


ctagatcaaa 


cctgaacccc 




tgaaagttcc 


aacattttac 


agggaagaag 




atgttataca 


gccctggctt 


ccactgacgc 




atctgttgcc 


accaaatgga 


aaacgtaaac 




caacacaagg 


aacaagcaga 


acattcttct 


35 


acagacagca 


ctgatgttag 


atgtacgtga 




aggcagagag 


gataatatag 


ctccagccta 



82 

gccggcccga ccaggcgctt tgtgtcgggg 2460 
cggtgggccg c 2501 



tggagtacat 


atactatcta 


cagtgtctgt 


60 


ctacccttga 


ctaagattac 


tgtaatgaga 


120 


attttggaaa 


ctgagctttt 


ggtgtttata 


180 


atttattcaa 


gactcatact 


gctgtatatg 


240 


cagtaactga 


tatagaaaat 


tttaaatgta 


300 


tatatatata 


tatatatata 


tatatatata 


360 


catcaggatg 


agtttctgtt 


taggcaatgt 


420 


aacaaatatt 


taacaaacat 


ttataaggca 


480 


gactcatttc 


tcataaccac 


cttggggtag 


540 


tgcttcacta 


ctaaatgcag 


gtggcagcct 


600 


aggtcttacc 


tattgagctg 


ggctgtgtag 


660 


gcagaagcct 


ctgctttttc 


cccctcctcc 


720 


atcccacttg 


gagaatcaca 


cttaaaccct 


780 


tctctagaac 


ttctggagaa 


aaaagtaata 


840 


gagagaaaac 


aacatgtgtg 


tgtttctcta 


900 


gtcctatagt 


aaatggaatg 


tgactctatg 


960 


gcatgtctgc 


tgcttcagtg 


ggaagggtgc 


1020 


atgtgctatg 


cagtccaaag 


acagaaccag 


1080 


agcccctccg 


aggtccggag 


tgacaaccag 


1140 


tcctacaggg 


aaaagatttc 


caggggattt 


1200 


gaagataagc 


aggatatgaa 


agaagagttc 


1260 


taacactgga 


ttcagctttt 


gacactgata 


1320 


aagatattct 


aagtgtggtt 


agagaatatg 


1380 


ctggaatctg 


acataatgga 


ctgtactttc 


1440 


aataggctaa 


actgaaaata 


agaaaggctg 


1500 


tctcccagca 


ccttgttaat 


ttctctcaac 


1560 
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ctccagccac 


aaatccgaga 


cacaacgctc 


ttcctccaaa 


gaggtcgcgc 


cttctctgtg 


1620 


gtggttctca 


gggatccgcc 


ccagctcctt 


ctccgttccc 


agccccacac 


actgggatca 


1680 


ccaggcaccc 


aagatcccac 


ctctcaggtg 


gtatcttcag 


cgcaggctgc 


cactcagccc 


1740 


ccctccaggg 


atctggggca 


gaaggcgaat 


atcccagagt 


ctcagagtcc 


acaggagtta 


1800 


ctctgaaggg 


cgaggcgcgg 


gctgcatcag 


tggaccccca 


caccccaccc 


gcaccccaag 


1860 


cgctccaccc 


tgggggcggg 


gccgtcgcct 


tccttccgga 


ctcgggatcg 


atctggaact 


1920 


ccgggaattt 


ccctggcccg 


ggggctccgg 


gctttccagc 


cccaaccatg 


cataaaaggg 


1980 


gttcgcggat 


ctcggagagc 


cacagagccc 


gggccgcagg 


cacctcctcg 


ccagctcttc 


2040 


cgctcctctc 


acagccgcca 


gacccgcctg 


ctgagcccca 


tggcccgcgc 


tgctctctcc 


2100 


gccgccccca 


gcaatccccg 


gctcctgcga 


gtggcgctgc 


tgctcctgct 


cctggtagcc 


2160 


gctggccggc 


gcgcagcagg 


tgggtaccgg 


cgccctgggg 


tccccgggcc 


ggacgcggct 


2220 


ggggtaggca 


cccagcgccg 


acagcctcgc 


tcagtcagtg 


agtctcttct 


tccctaggag 


2280 


cgtccgtggc 


cactgaactg 


cgctgccagt 


gcttgcagac 


cctgcaggga 


attcacccca 


2340 


agaacatcca 


aagtgtgaac 


gtgaagtccc 


ccggacccca 


ctgcgcccaa 


accgaagtca 


2400 


tgtaagtccc 


gccccgcgct 


gcctctgcca 


ccgccggggt 


cccagaccct 


cctgctgccc 


2460 


caaccctgtc 


cccagcccga 


cctcctgcct 


cacgagattc 


c 




2501 



<210> 45 

<211> 2501 

20 <212> DNA 

<213> Homo Sapiens 



<400> 45 



2 5 ggcgacagag 
ctcaagttga 
tttatctata 
gacttagaat 
tatatttgtg 

30 ttccactcaa 
ctataaaaac 
tgagagtcag 
gagactggct 
taagtgtcaa 

35 ttcttactgt 
cttaggtttt 



caagactccc tcttaaaaaa 
atgcattttg tcatccacaa 
aaatagagat aacaatagtt 
aatgttcagc ctctaatcag 
tactttgtag attgatatta 
ttaccttaaa acatgtttaa 
tttttacata tagcgtccac 
ctatctgaac caaagaaaca 
tttatcaatt ctttagctac 
acccaaatgc cttgtatctg 
ttccatttcc tctgccatgc 
aaacggggca tctctcaacc 



aaaaaaaaaa aaagattctg 
gacaaatcgt gttaacccct 
cctgcttcta gggttgttgt 
tgctgtcaca actgtctgat 
aatcatactt ttaaaaatag 
ttatgtctct atcctactct 
ttttggttca gtttcttagg 
ttaacattac cagactatat 
gggctcttgt catcatctct 
tcccattaaa gagatgcagc 
ctcctcttac aaccataaat 
cccacattct tttccttggt 



agtcaaagtg 60 

tgtggtttac 120 

gggaattaaa 180 

acaattgtat 240 

gtgcttaatg 300 

tataacactt 360 

aaaataactt 420 

tgggattttt 480 

accagtgacc 540 

atctgctcct 600 

atccaggtct 660 

tattcccttc 720 
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cctccaacag 


ttcaattcac 


ctagatcccc 




gcgcctcatc 


attacaatgg 


tacattattc 




aaactgaaaa 


tctgagcgtt 


catccctggt 




tccagggtca 


tgtaacctta 


aaaaattttt 


5 


ccttatctct 


tccagcagct 


gttccaaagg 




ctccaccgaa 


gcctccaccc 


agttgccaat 




gttgacatac 


ataaaatttc 


taagacaaaa 




aactgcatac 


agatcataca 


gatccataat 




actttccatt 


taactaacat 


ttgcactggt 


10 


ccacattacc 


ttctactgaa 


gaggttgtgg 




tacaagttag 


agaaacagcg 


ttactcgaaa 




cc tgacaaac 


ggagcgctgt 


gggtaggggt 




cctgggcgag 


ggcgccgcag 


ggcaaagacc 




agcaactggg 


cctcccgcgc 


agcagaaaag 


15 


cagcgcagac 


cgcaagtctg 


gccccatcct 




ccaggtaccc 


cgaccgtggg 


cagcctgcgc 




gatacctgag 


cggtggccag 


ggcaggtccc 




cagcgacgat 


gcagtttagc 


gaaccaacca 




tcgttacagc 


agaacgcgcg 


gtcaagtttg 


20 


tccctatttg 


tgcaggcgag 


gccccgcccc 




gctcgcgccc 


gcagacgcct 


gggaactgcg 




gccgccgggc 


tgccatcctt 


gccctgccat 




cgctgcctcc 


agctcagccc 


ctgcgaggca 




gggaagcctg 


aaatccacct 


cctcctccac 


25 


tgcagcggcc 


gcagcgcccc 


cagcgcccgc 




aggttctgtc 


tgggactggg 


cagggccatc 




gcgggcggag 


gcgtggaccc 


tccgcccgat 




aagaagggga 


aggtgggaag 


agcccagccg 




ttagtacttc 


cgtttgagga 


gataggcaaa 


30 


gacaggaact 


caggtctcct 


aactcccatt 




<210> 46 








<211> 1092 








<212> DNA 






35 


<213> Homo 


Sapiens 





84 



acgcctgaaa 


ttatcctaga tgtcctagag 


780 


tccactcctt 


tacatgtcac 


gccagctttc 


840 


gcatcacctt 


taaattccag 


atctccaaaa 


900 


accctctctt 


ctccactgcc 


cttgttcagg 


960 


cctactctgt 


tttcctttcg 


gagtgctaac 


1020 


tctgccccat 


gcctgataat 


ttgctcgtgc 


1080 


attttttaat 


aatggtaaat 


gaaccttggg 


1140 


aagagaaaag 


gtcccagatt 


aacacggaaa 


1200 


aaacttcatc 


aagcaagacc 


ctacttaatc 


1260 


tcattctctg 


gaaatatctg aattcattcc 


1320 


cattatccct 


tgggctcgag ctctaaggca 


1380 


gaggtgtttt 


ctccagggct 


gggactttgc 


1440 


tcaccgggca 


gcagaatccg ggcagaaatc 


1500 


gggaatccag 


tcggggccca cccttcctgc 


1560 


ctcgccggga 


gtcggcctgg cgcgtcccgc 


1620 


ccgtttgggt 


cccatcgccc 


cggcccggca 


1680 


cgttcttgcc 


gatgcccatg 


ttctgggaca 


1740 


tgacagcagc 


gggaggacct 


ccgagcccgc 


1800 


gcgcgaaatt 


gtggccgccc 


cgccccctcg 


1860 


cccgccccgg 


cgcacgcagg 


gtcgcggcgt 


1920 


gccgcgggtt 


cgcgctcctc 


gccgggccct 


1980 


gtctcgccgg 


aagcctgcgt 


cgggcggcct 


2040 


agcggttttg 


agccgattct 


tccagtctac 


2100 


aggtgcagcc 


gaccaggtgg accctggcgc 


2160 


cttcccgccc 


cagctgccgc 


cgcacgtagt 


2220 


ggggctgggg 


gggcggggct 


tgtgggtaag 


2280 


gatagggctg 


gaggaggaag gggcgggctg 


2340 


gggctacaaa 


ttgggtgaag 


cgctgaggtt 


2400 


ggttatgcag 


gtttttaatg gcaggcctga 


2460 


ctgatgaggg 


g 




2501 
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<400> 46 



aagcttcccc 
agtgctggac 
5 ggccaaagga 
cgcacgccgc 
cgcctttcca 
tcggccgcgg 
ctcccggagg 

1 0 gcgcgtggag 
ggcaagcggg 
tgctgctgct 
ccttggcctt 
tgccgtgttg 

15 cgcgagggtg 
aaaaacgacg 
gggcgccttt 
tgttggatgt 
tattttctta 

20 tgtgttttaa 
gagaataagc 



ttcatcatcc aagaaggcat 
tacccagtaa ttgggttcag 
ccactcctct cctcagcgct 
gcccccgaag cctggctccc 
ggtcttctcc cggtgaaccg 
cccgggtccc tcgcaaagcc 
ctggccgggc aggcgtggtg 
gagacactgc cctgccgcga 
gcggcggctg cggtacctgc 
ctgcatccag ctcgggggag 
cccctgtggt cgggggcggg 
ctaggcagcc tggtcgccgg 
ggaggccctg gggcgtttcc 
gcaaagcgtg ttcttgactg 
attcccagct ggaagcccag 
gaaaataccc tttggaagta 
ctctgctccc ttcattaccc 
gttacactgt atgtttattt 
tt 



tcaggtcttt ctgtgctagg 
tagcaggatg gcctcagatt 
ggtccgggaa aggcaagctc 
tcgccacgcc cacttcctgc 
gatgctctgt cagtctccta 
gctgccatcc cggagggccc 
cgcggtagga gctgggcgcg 
tgggggcccg gggcgctcct 
ccaccgggag ctttcccttc 
gacagaagaa aaaggaggta 
ccagggtggg ccgcgttgcc 
cgtgggcgat gccggcgctg 
gggacgtgga gttagcaggg 
cttctgagca cctcacacct 
cttagagcaa tggtgccact 
tttataagcc tgcaggaaat 
atttcaagaa gcaacagaac 
ttgtttatgt tgaactcggt 



ccccaggtaa 60 

gaggtcccag 120 

cgggcgggag 180 

ccccatcccg 240 

ctctgcgtcc 300 

agccagcggg 360 

cacggctacc 420 

tcacgccgta 480 

cttctcctgc 540 

gaatggatcc 600 

caggcagccc 660 

gggcgggagc 720 

ttctgacttg 780 

ttcagaccca 840 

aaaaggggtg 900 

atgttttcct 960 

ctgtgcagag 1020 

gtatacttgt 1080 
1092 



<210> 47 

<211> 2501 

25 <212> DNA 

<213> Homo Sapiens 



<400> 47 



cgaaatgaaa cctcgcccag gaggccgcgg acctggacac ccggcgccac ctccttcacc 60 

tctgacccag gtttcctccc ggcgctgcga gctcccgggg aagggttaga gccggcagcc 120 

ctccccagcc cggggagggg agagggttat gcgaccccac ctctggctag ggccggggag 180 

gcctttgctt cccgggagcc ctgcccgggc tccttggtcg cagggctgct gggtcccagg 240 

caggaacgag agggtgaggc ccacatgtgg cccggcggcc cagggcggct tgcagcgtcc 300 

tcactgtccc ggctgccagg ggctgcggcg acgcggccag tcagcagcga gttcaggtcg 360 

cgcagatttt attgatgagc tctgactttc agcactttcc ctaagtcaag aagagtctag 420 



1386-WO 



86 





cgtacccttc 


ggctgcttca 


tttcagcctc 


cctgcctcag 


ctcttcagcc 


ctattccccc 


480 




tcgccctgtc 


ctggggtgtg 


tacagcagcc 


caggccttcc 


ttctccttcc 


cggctccgtg 


540 




gcccgaagcc 


gccgagagag 


ctcgggacag 


cgcaggacca 


ggcagccgct 


cgctctcctg 


600 




tcaccttaac 


tgcaggctcc 


gaggggcgcc 


tttggagtgt 


actgaggtgt 


gtcctaatcg 


660 


5 


tgcggcattc 


aacaaatgga 


cttctggtgt 


gtggtcagaa 


gagaaaagcc 


atttacttac 


720 




tttcctcccc 


ggttttctgg 


caacagctga 


aggggagctg 


cctccgtgga 


ctgagcagac 


780 




ccaggagagg 


gagtcgtggt 


gcggagacac 


acgcaccaca 


cacagatgac 


cggtggcaca 


840 




cacgacacac 


gctgacatac 


cgacatcgcc 


agtgggacac 


acacacacac 


acacacacac 


900 




acacacacac 


acacagagag 


agagagagaa 


tccctcccag 


cattggtcat 


ccgccccccc 


960 


10 


acccaggctt 


ccactccccc 


tcccctctta 


tctcccctgg 


cttcccctcc 


tctcgggcgc 


1020 




tgcgaaaagc 


agccgcactt 


agtcaacaaa 


tggcacgtgg 


gagaagttgg 


tgagtgtttg 


1080 




gtgaggactc 


ttcagggctt 


ttcacaagaa 


ccctctgtac 


acaaagtaag 


tggcgtgttt 


1140 




actcgggcct 


ctccagccag 


agctgtgcct 


ctgctccgct 


gcgcaccgcg 


gcttccgaaa 


1200 




ggagaaagga 


gagaagaaag 


ggcggggaga 


gcggggtgga 


ggatttggac 


aggccctgga 


1260 


15 


ggcttgggct 


ggggaggcct 


ctggcctcgt 


ttagttctcg 


gcccggcaac 


ctcctctcgg 


1320 




cctaggcttc 


gccgcggcct 


ccgcagctgg 


aatggagctg 


ccaggaccca 


gtgacgctcc 


1380 




cgcccctttc 


ctcttcttcc 


aaggggccag 


gtgggctggg 


gtgcggccgc 


cgctgtgctc 


1440 




tgtgtcttgg 


ggccccggct 


gggatggggt 


gggggcgggc 


gggggcgggg 


cggcaggcca 


1500 




cgctgtcctg 


gagttggcaa 


gaaaggacag 


cacagaaact 


tgcaccctcc 


gaggactggg 


1560 


20 


agtcccgagt 


ccagcttagg 


gggagtgggg 


gcgcgacccc 


caacccagaa 


accttcactt 


1620 




gaccgctcaa 


gttcgcggca 


gcagggcggg 


ccgcgccgaa 


tctcggcgtg 


cgcggagcgg 


1680 




ggagatgcag 


gcgagcgcca 


gagcccgggc 


tcgggggccc 


tgcgccgggg 


agaggagccg 


1740 




ggacccaccg 


gcggagccga 


aaacaagtgt 


attcatattc 


aaacaaacgg 


accaattgca 


1800 




ccaggcgggg 


agagggagca 


tccaatcggc 


tggcgcgagg 


ccccggcgct 


gctttgcata 


1860 


25 


aagcaatatt 


ttgtgtgaga 


gcgagcggtg 


catttgcatg 


ttgcggagtg 


attagtgggt 


1920 




ttgaaaaggg 


aaccgtggct 


cggcctcatt 


tcccgctctg 


gttcaggcgc 


aggaggaagt 


1980 




gttttgctgg 


aggatgatga 


cagaggtcag 


gcttcgctaa 


tgggccagtg 


aggagcggtg 


2040 




gaggcgaggc 


cgggcgccgg 


cacacacaca 


ttaacacact 


tgagccatca 


ccaatcagca 


2100 




taggtgtgct 


ggctgcagcc 


acttccctca 


cccacactct 


ttatctctca 


ctctccagcc 


2160 


30 


gctgacagcc 


cattttattg 


tcaatctctg 


tctccttccc 


aggaatctga 


gaattgctct 


2220 




cacacaccaa 


cccagcaaca 


tccgtggaga 


aaactctcac 


cagcaactcc 


tttaaaacac 


2280 




cgtcatttca 


aaccattgtg 


gtcttcaagc 


aacaacagca 


gcacaaaaaa 


ccccaaccaa 


2340 




acaaaactct 


tgacagaagc 


tgtgacaacc 


agaaaggatg 


cctcataaag 


gtgagtccgc 


2400 




ttctttcttc 


tcgctttatt 


tttattgcaa 


tattcagaca 


ggtctccccc 


ttcctccccc 


2460 


35 


cttccttcct 


cccctctcgc 


cggtcccctc 


ccccactgct 


a 




2501 
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<210> 48 

<211> 2501 

<212> DNA 

<213> Homo Sapiens 

5 

<400> 48 





tgatggttgc 


acaactctga 


gtacatgaaa 


aatcaatgaa 


ctgatacttt 


gagtgagctg 


60 




tatgatactg gaattacacc 


tcaataaagc 


atggtaactg 


ttttaagata 


ggctggaaag 


120 


10 


agaaagcctg 


aaaacaacaa 


taatgatatt 


aataaattag 


tttacttctc 


tagtctcata 


180 




tacttctgtg 


cccacacttg 


ctcctgttct 


attcataatg 


gtccccttgc 


agttgccata 


240 




ttatatcctg ccatttgatg 


cccggtgaac 


attctatacc 


tgcttcccag 


aattctcttt 


300 




acctttcctc 


tatctgccta 


acttccacat 


atctaaaatt 


aatcagagta 


aactatttac 


360 




tagaacaacc 


aactccaaat 


cctagtaacc 


taacatgata 


aaggtttgtt 


tctcactcat 


420 


15 


atagcccctc 


cccagatgat 


cgaggggtcc 


aggctcctta 


cctctagtgg 


ctcccccacc 


480 




ttctggagtc 


ttctgcattc 


tttatacatg 


gttgagataa 


actatgagtc 


attagcacag 


540 




ctagaccttg 


aggtcctaca 


agaaaatttg 


caaatcattc 


actctgtttt 


gaacaaggta 


600 




tatttaagat 


gatgttaaaa 


tacccaatgg 


tcttgggtca 


aatacagttt 


atgactgtgt 


660 




atctaaaata 


tatattgcaa 


tattcttccc 


tttttctact 


gacttcatga 


atttagcggg 


720 


20 


gatccatttt 


ataagctcaa 


agataattac 


ttttcagact 


aagaatattt 


agggtaaaaa 


780 




gtactgttca 


acatctctac 


tgaggatgtt 


atgatgtagc 


acactgtata 


agctggagct 


840 




aaaggaaact 


ttccttaaag 


tgctatttac 


taaaaattgg 


aacacattcc 


ttaagacaaa 


900 




tcgaagtgtg 


gcacacaaca 


tccaaacttc 


catcatagat 


acagaggtgt 


taccatctcc 


960 




cactcccaaa 


tttctttgtc 


acgctgagga 


tactcaagag 


gagcaggaca 


tgttggtcgc 


1020 


25 


agcaggagaa 


acttgaaagc 


attcactttt 


atggaactca 


taagggagag 


aatttcttat 


1080 




tttagtatcg 


tccttgatac 


atttattatt 


ttaaaagata 


atgtagccaa 


atgtcttcct 


1140 




ctgtgttaaa 


tctttacaaa 


actgaaatct 


taaaatggtg 


acaaaaattc 


tacttctgat 


1200 




agaatctatt 


catttttcca 


attagatagg 


gcataattct 


taatttgcaa 


aacaaaacgt 


1260 




aatatgctta 


tgaggttcca 


tcccaaagaa 


cctgctattg 


agagtagcat 


tcagaataac 


1320 


30 


gggtggaaat 


gccaactcca 


gagtttcaga 


tcctaccggt 


aattggggta 


gggaggggct 


1380 




ttgggcgggg 


cctccctaga 


ggaggaggcg 


ttgttagaaa 


gctgtctggc 


cagtccacag 


1440 




ctgtcactaa 


tcggggtaag 


ccttgttgta 


tttgtgcgtg 


tgggtggcat 


tctcaatgag 


1500 




aactagcttc 


acttgtcatt 


tgagtgaaat 


ctacaacccg 


aggcggctag 


tgctcccgca 


1560 




ctactgggat 


ctgagatctt 


cggagatgac 


tgtcgcccgc 


agtacggagc 


cagcagaagt 


1620 


35 


ccgacccttc 


ctgggaatgg 


gctgtaccga 


gaggtccgac 


tagccccagg 


gttttagtga 


1680 




gggggcagtg 


gaactcagcg 


agggactgag 


agcttcacag 


catgcacgag 


tttgatgcca 


1740 
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gagaaaaagt 


cgggagataa 


aggagccgcg 




actcaagtgc 


cggacttgtg 


agtactctgc 




ctctcttgga 


gaactccccg 


agttaggaga 




tgcgctcccc 


acttgccgct 


cgctgggaca 


5 


atggaggcca 


agggcaggag 


ctgaccagcg 




ggagatccct 


ccggtccagc 


cacattcaac 




tcccgaaacc 


ccctcctcct 


tcccttttcc 




gggagtccag 


tcgtcatgac 


tgagctgaag 




ggcggcccgc 


cctcccccga 


ggtcggatcc 


10 


ttcccgggga 


gccagacctc 


ggacaccttg 




gacgggctac 


tcttccctcg 


gccctgccag 




gaccagcagt 


cgctgtcgga 


cgtggagggc 




gctggaggca 


gcagttctag 


tcccccagaa 


15 


<210> 49 








<211> 2501 








<212> DNA 








<213> Homo 


Sapiens 




20 


<400> 49 








taccttcata 


aaaggatctt 


tgacttggta 




accacaagtg 


ccacttagca 


actccactag 




gccccctgac 


aggcttttaa 


aaggcccgga 


25 


aaaggagact 


ggagcagtgc 


tcttccctgc 




aaccacttgc 


ctgccctgtt 


ttgtgggaga 




gaaaaaaggg 


tgttttaatt 


cctctccaga 




ttaaacaagt 


tgtgcaaaat 


ggctgcaggg 




tccggctggg 


tttatttgta 


aatactgcat 


30 


gaaagactct 


tcttcctgta 


ggttgggtca 




taaaattact 


gtattttaat 


aaaaggattg 




cttcaaaatg 


taaaactgtc 


atacttaaac 




gtgggatgtg 


tgtaggtgta 


tgatgagaga 




gatgtgggaa 


caggcaagta 


attttcacat 


35 


gtcacagctc 


tgagctgtgt 


gaccttgggt 




cagttgaaca 


gtatgaggat 


gagtcacagc 



88 



tgtcactaaa 


ttgccgtcgc 


agccgcagcc 


1800 


gtctccagtc 


ctcggacaga 


agttggagaa 


1860 


cgagatctcc 


taacaattac 


tactttttct 


1920 


aacgacagcc 


acagttcccc 


tgacgacagg 


1980 


ccgccctccc 


ccgcccccga 


cccaggaggt 


2040 


acccactttc 


tcctccctct 


gcccctatat 


2100 


ctcctccctg 


gagacggggg 


aggagaaaag 


2160 


gcaaagggtc 


cccgggctcc 


ccacgtggcg 


2220 


ccactgctgt 


gtcgcccagc 


cgcaggtccg 


2280 


cctgaagttt 


cggccatacc 


tatctccctg 


2340 


ggacaggacc 


cctccgacga 


aaagacgcag 


2400 


gcatattcca 


gagctgaagc 


tacaaggggt 


2460 


aaggacagcg 


g 




2501 



agtgtgtgcg 


atgcatactt 


ttcatgttac 


60 


acagggcagt 


gtttcagcat 


ggggtggggt 


120 


tgccaatgca 


cattccaaca 


ctatccacaa 


180 


attgggcaag 


gagactctcc 


ctccctgcct 


240 


attacaagta 


aatgctacag aggcagtgga 


300 


gtttccttta 


tttgatgtat 


gttgcatcct 


360 


tagattggct 


ctccctttta aagctctcca 


420 


ctatccttct 


tagtgtttta ggactggctg 


480 


gtgtgagaga 


tctaaaaaat 


cattttccct 


540 


ggcaggggct 


ggaatgagag 


aaaactggtc 


600 


cagtttacaa 


aatatgcgtt 


taattatgtg 


660 


ggcaaccaac 


atggctattt 


ggggtgcaag 


720 


tggactttca 


tcctagggag 


ctgggttcta 


780 


aggtctcatc 


tccccggggt 


tttgtttcac 


840 


taacatttgt 


tccatgatat 


ttacccagca 


900 
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89 



ccatacaagt 
attttacaga 
ctggtaagtg 
ttgaccacta 
5 tgtttgggac 
gatttttcaa 
tcagccaggg 
acccaagcct 
tttttctgga 

10 gaggaattta 
cggaacaccg 
ggtgctcaac 
ttcttgtatt 
cctgcctctc 

15 tcctgggagt 
gttcaccgaa 
agtagtagga 
agaattcccc 
agggtttgtc 

20 gcatttactt 
gccgagcaag 
cttgtctgtc 
aaaggggcag 
gaggataagc 

2 5 ttctgtcagt 
tgctccagga 
atcggcacac 



gttatttctg tcctcccagt 
tgaggaaact gaagcctgaa 
gcagacctgg aattggaatc 
tactgttttt tcaagtccag 
aggggtaacc aattcctgac 
agtaaatgaa tctcgaaagc 
gcttgcaaga gggagaaaga 
ttcccaaggg gtagccattc 
gtggaaaaat acataagtta 
aagtgtgggc tggggggcga 
ttttccaagc taagccgccg 
gtgagccagg agcagcgtcc 
gtttttaagg tgagaaatag 
tggctgtctg cttttgcagg 
tggtgatgtc agactagttg 
agttcactcg catatattag 
agtgagctgt tcagaggcag 
catgcgagct gtttgaggac 
tgggcaccgt cggggtagga 
ggaaggagaa cttgggatct 
cctggaaaat ggtaaatgat 
ataattcatg attcggggct 
agtttgatgg agttgggtgg 
acttttgcag acattcagtg 
gagtcctggg caaatcctgg 
gaaagctctc aaagcatgct 
tgctcaatgt aggtttattt 



taacactgac gtgggtagta 
gaagttaaat acttatccca 
tagttcagtt tgattcccca 
atctgaaatc tcattttctg 
tactctatat gctgcataga 
tggattgcag agcaaacgag 
gaaaaagact gtggaatgga 
tctgttctac agtttagggc 
taaggaattt aacagacaga 
ggcggtgggc gggaggcgag 
caaataaaaa ggcgtaaagg 
cggctcctcc cctgctcatt 
gaaagaaaac gccggcttgt 
gctgctggga gtttttaagc 
ggtcatttga aggttagcag 
gcaattcaat ctttcattct 
gagggtctat tctttgccaa 
tgggatgccg agaacgcgag 
tccggaacgc attcggaagg 
ttctgggaac cccccgcccc 
catttggatc aattacaggc 
gggaaaaaga ccaacagcct 
acttttctat gccatttgcc 
caagggagat catgtttgac 
atttctacac tgcgagtccg 
tcagtggatt gacccaaacc 
ttttcccttc t 



ttatatgccc 960 

gaacacacag 1020 

acccatgctc 1080 

tgtggctgtg 1140 

acctggagag 1200 

tgcagtcaat 12 60 

aagtttccca 1320 

ttgcatgtgc 1380 

aaggcgcaca 1440 

cgggcgcagg 1500 

gagagaagtt 1560 

ttaaaagcac 1620 

gcgctcgctg 1680 

tctgtgagaa 1740 

cccgggtagg 1800 

gtgtgacaga 1860 

aggggggac c 1920 

cgatccgagc 1980 

ctttttgcaa 2040 

ggctggattg 2100 

ttttagctgg 2160 

acgtgccaaa 2220 

tccacaccta 2280 

tgtatggatg 2340 

tcttcctgca 2400 

gaatggcagc 2460 
2501 



<210> 50 

30 <211> 2501 

<212> DNA 

<213> Homo Sapiens 



35 



<400> 50 

ggaggataga aatataaatt aaagaatgac acaaataatt ataaagttac agctgttaaa 



60 



1386-WO 





agaaaagcat 


atggtgccaa 


gagaacgtgt 


aatacaagat 


ctactcatgg 


aggtgaggga 


120 




aagcttgccc 


atcaaagaag 


ttatgattca 


atccacgaag 


accaggagtt 


ggctgggtga 


180 




agaaaaaaag 


gtcagaggaa 


ggaagtccac 


actggggaag 


gctctaagca 


taaagggtag 


240 




gaggattaca 


gaggcatatt 


cacgaaattt 


ggagaaggct 


ttcagtaagc 


aaggagaagc 


300 


5 


caaatgaaag 


tttacgggag 


agttggaggc 


ttgaagacac 


gttcaaggat 


ctggttttta 


360 




tcttctcttt 


atctcaagag 


cagtgggaag 


ccattaaatg 


attttaatca 


gagggttggt 


420 




ataactagtt 


ttgtattttg 


aaaagctgaa 


ttcagctctc 


gtttgagaaa 


ctgagtgaaa 


480 




gagcccagaa 


cggccgtggc 


tgagggtgac 


tcgtgggaga 


ctcctacaca 


agccatggca 


540 




gtggcatggg 


ctggtggcag 


aagagggaat 


agggagaaga 


tttggaactc 


aatcttcctc 


600 


10 


cattgacaaa 


gtcactccag 


ctttggcaag 


gcaattaatt 


ggtgggaaag 


aagatgccta 


660 




gccctcctga 


tttcactgca 


ctttctgcat 


cttcaacatg 


agtactggga 


agtggcaaaa 


720 




catccagagg 


cagcttgggt 


gctaggtgga 


gcatgagtta 


aaattccagg 


atgaagcaaa 


780 




tgaacactta 


gaatgacagg 


aaagatttgg 


gagttgggtt 


tgggggaggg 


ctatttacct 


840 




ttattccctg 


gagaccctgg 


cacaaaccct 


tgcctctgca 


atcttcctct 


caggtaaagg 


900 


15 


aattcattaa 


atgaattgct 


agaagatcta 


ctgaccagag 


ggctgtacag 


aatcatatct 


960 




ttgagagtgg 


gaagtaggtt 


gatcacatag 


tttattatcc 


aatcaggaca 


tatctgaaag 


1020 




agaaaggggg 


ttctattaat 


atttaaacta 


caaaacatgt 


acaccaggaa 


tgtcttgggc 


1080 




aaatctggtt 


gccctagcaa 


gaaaggaaat 


ttgaaagttt 


atactgttct 


gctcccatgt 


1140 




taccccgttt 


gcacatgaga 


gggtaagtat 


tctctttctt 


cacctgcatt 


aagggaataa 


1200 


20 


aagcacaagc 


attcaggtga 


ctcccaaccc 


acttttaatt 


ttacagtttc 


tgctatactc 


1260 




tatacattct 


gaaaattaca 


tttcccacca 


ctatcacttc 


gtgataggtg 


atcatttaca 


1320 




attactcact 


gactcagtcc 


cgggaagagg 


cggtgcaaaa 


tgggacgctc 


tatccaggtg 


1380 




ctcattagaa 


atgcagaatc 


tctgcctgcc 


tcctagacct 


actgaattag 


aatctgcatt 


1440 




tttaaataag 


atttccaggt 


gatcaatatg 


tacattaaaa 


cttgagaaaa 


acctctagac 


1500 


25 


ttcgacctaa 


agaaaaacat 


tttacaactt 


gacagtgtat 


gcacatacat 


acatgcatat 


1560 




agacacaact 


gaagcacaaa 


tttaatgaag 


tagaatttac 


cgttactatt 


ttatttggga 


1620 




aagaaatgtg 


ctcgcgactc 


aatagattgg 


agtattcact 


cctggatctc 


aacttgcaat 


1680 




ttgaaaacgc 


atctctaaag 


cacctaggag 


caatctgaag 


aaagctgagg 


ggaggcggca 


1740 




gatgttctga 


tctactaggg 


aaaacgtgga 


cgttttctgt 


tgttactttg 


tgaactgtgt 


1800 


30 


gcacttagtc 


attcttgagt 


aaatacttgg 


agcgaggaac 


tcctgagtgg 


tgtgggaggg 


1860 




cggtgagggg 


cagctgaaag 


tcggccaaag 


ctctcggagg 


ggctggtcta 


ggaaacatga 


1920 




ttggcagcta 


cgagagagct 


aggggctgga 


cgtcgaggag 


agggagaagg 


ctctcgggcg 


1980 




gagagaggtc 


ctgcccagct 


gttggcgagg 


agtttcctgt 


ttcccccgca 


gcgctgagtt 


2040 




gaagttgagt 


gagtcactcg 


cgcgcacgga 


gcgacgacac 


ccccgcgcgt 


gcacccgctc 


2100 


35 


gggacaggag 


ccggactcct 


gtgcagcttc 


cctcggccgc 


cgggggcctc 


cccgcgcctc 


2160 




gccggcctcc 


aggccccctc 


ctggctggcg 


agcgggcgcc 


acatctggcc 


cgcacatctg 


2220 
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cgctgccggc 


ccggcgcggg 


gtccggagag 




gggaaggcgc 


cgtccgctgc 


gctgggggct 




gggtcggggg 


ctgctcaggg 


gcctgtggcc 




cagcacgatc 


ccaccgcacg 


ttcagaagtc 


5 


gggcgccggg 


ggtcttcctg 


gggtccccgc 




<210> 51 








<2U> 2500 








<212> DNA 






10 


<213> Homo 


Sapiens 






<400> 51 








ttcccatcaa 


gccctagggc 


tcctcgtggc 


15 


cttggcgaga 


agcgcctacg 


ctccccctac 




caccgccccc 


ccgccgcctg 


cagagggcgc 




tctccaggct 


tcagacctgt 


ctccctcatt 




gctacccagc 


actgatatag 


gcactcagga 




tctatatcct 


cataaggctt 


acgtttccat 


20 


tcagagtgat 


aagggttgtg 


aaggagatta 




agaaaacgtt 


agggtgtgat 


attacggaaa 




gagaagaaag 


gatccagctg 


agagcaaacg 




ttgacacaat 


gcaggattcc 


tccaaaatga 




cttgcagaat 


tttccacccc 


aaaatgttag 


25 


gactctgcac 


cctcctcccc 


aactccattt 




gcccttactt 


gtcatggcga 


ctgtccagct 




gattggggtt 


ttcccctccc 


atgtgctcaa 




aagtctagag 


ccaccgtcca 


gggagcaggt 




cgggctggga 


gcgtgctttc 


cacgacggtg 


30 


actgaacttg 


atgagtcctc 


tctgagtcac 




ggaaaatcgc 


tggggctggg 


ggtggggcag 




gatggaagct 


tggctagagg 


gatcatcata 




gatgatgggg 


atgttaggac 


catccgaact 




gctttgggga 


accttgagcc 


ggcctaaagc 


35 


gcgtagggaa 


tccctgaaat 


aaaagatgca 




tcgaaacatt 


gagaactcat 


agctgtatat 



91 



ggcgcggcgc 


ggaggcgcag 


ccaggggtcc 


2280 


cggtctatga 


cgagcagcgg 


ggtctgccat 


2340 


gctgcacatc 


gtcctgtgga 


cgcgtatcgc 


2400 


gggtgagtgg 


tccccagccc 


gggctcggcg 


2460 


ctctccgctg 


c 




2501 



tgctgggagt 


tgtagtctga 


acgcttctat 


60 


cgagtcccgc 


ggtaattctt 


aaagcacctg 


120 


agcaggtctt 


gcacctcttc 


tgcatctcat 


180 


caaaaaatat 


ttattatcga 


gctcttactt 


240 


atacaacaat 


gaataagata 


gtagaaaaat 


300 


gtactgaaag 


caatgaacaa 


ataaatctta 


360 


aataagatgg 


tgtgatataa 


agtatctggg 


420 


gccttcctaa 


aaaatgacat 


tttaactgat 


480 


caaaagcttt 


cttccttcca 


cccttcatat 


540 


tttccaccaa 


ttctgccctc 


acagctctgg 


600 


tatctacggc 


accaggtcgg 


cgagaatcct 


660 


cctttgcttc 


ctccggcagg 


cggattactt 


720 


ttgtgccagg 


agcctcgcag 


gggttgatgg 


780 


gactggcgct 


aaaagttttg 


agcttctcaa 


840 


agctgctggg 


ctccggggac 


actttgcgtt 


900 


acacgcttcc 


ctggattggg 


taagctcctg 


960 


gggctctcgg 


ctccgtgtat 


tttcagctcg 


1020 


tggggactta 


gcgagtttgg gggtgagtgg 


1080 


ggagttgcat 


tgttgggaga 


cctgggtgta 


1140 


caaagttgaa 


cgcctaggca 


gaggagtgga 


1200 


gtacttcttt 


gcacatccac 


ccggtgctgg 


1260 


caaagcattg 


aggtctgaga 


cttttggatc 


1320 


tttagagccc 


atggcatcct 


agtgaaaact 


1380 
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ggggctccat 


tccgaaatga 


tcatttgggg 


ccacaacttc 


cggacctttg 


tccttcctgg 


ctagatggag 


aaaatccaat 


tgaaggctgt 


gggtttgccc 


gccaggccga 


ggaggaccgt 


ttgtttggct 


ccacatttac 


atttctgcct 


cggagcagct 


cactattcac 


ccgatgagag 


gccggttcct 


cttacttggc 


agagggaggc 


gattacttag 


ttatggcctt 


tgcaaaggca 


cctccccttc 


tttgaatggt 


gtgccccacc 


taccatggcg 


tagacaggga 


gggaaagaag 


aagaatgagc 


atatctcatc 


ttcccggaga 


gaaattttga 


aagagtgcaa 


tgatgggtcg 


ctgttatcta 


gctttgggct 


aggccattcc 


cggaaggqgc 


ggcrccccrcacr 


gcgtccgtgt 


gttcttcctg 


gtaggaggcg 


gaactcgaat 


cgcggttgtt 


tcattccgca 


gtttcttccc 


ccgtacttac 


gtcatctttt 


tcctaaatcg 


acagcaagtg 


cacaggaaga 


tgagttttgg 


cacagaccct 


tttcatcgtc 


ccagaaacgt 



20 

<210> 52 

<211> 286 

<212> DNA 

<213> Homo Sapiens 

25 

<400> 52 

tttgcactag gctggaagtg gccgccagtc 
gaatcagctg gcattgccca gcgtgatttg 
30 aaatgggatg ccacctccgc ggggctcgct 

tgcaaaacga gggagcgtta ggaaggaatc 
agcctctacc caatgctttc gtgatgctgc 

<210> 53 
35 <211> 1400 

<212> DNA 



92 



gtgatccggg 


gagcccaagc 


tgctaaggtc 


1440 


agcgatcttt 


ccaggcagcc 


cccggctccg 


1500 


cagtcgtgga 


agtgagaagt 


gctaaaccag 


1560 


cgcaatctga 


gaggcccggc 


agccctgtta 


1620 


cttgcagcag 


catttccggt 


ttctttttgc 


1680 


gggaggagag 


agagagaaaa 


tgtcctttag 


1740 


tgctattctc 


cgcctgcatt 


tctttttctg 


1800 


ggggtatttg 


ttttgatgca 


aacctcaatc 


1860 


ccccgggtcg 


cctgcaacct 


aggcggacgc 


1920 


tgtgcagaag 


gcaagcccgg 


aggcactttc 


1980 


aaaaaaaaaa 


agaatggtac 


gtctgagaat 


2040 


tttgataatt 


tgtcgggaaa 


aacaatctac 


2100 


agttccagac 


gcaggctgaa 


cgtcgtgaag 


2160 


ggtcctccgt 


gcagccctcg 


gcccgagccg 


2220 


tcatttctcc 


cgctgcccca 


tctcttagct 


2280 


atgcacctgc 


cgcgtaccgg 


ccactttgtg 


2340 


aggtggcatt 


tacacacagc 


gccagtgcac 


2400 


cccctaaccg 


ctccgtgatg 


cctaccaagt 


2460 


ttcatcacgt 






2500 


ccccgtgcaa 


ttccattctc 


tggaaaagtg 


60 


tgaggctgag 


ccccaacagt 


ccaaagaagc 


120 


cctcgcgagg 


tgctcacccc 


gtatctgcca 


180 


cgtcttgtaa 


agccattggt 


cctggtcatc 


240 


tgctgatcta 


tttggg 




286 
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<213> Homo Sapiens 
<220> 

<221> unsure 
5 <222> (1371) 

<223> unknown base 

<400> 53 

10 ttccagctgt caaaatctcc 

cgagaataga aaattagccc 

ctttgatcca tggtcacaac 

aactttaaaa ctcgaattta 

atccacgcat cagggagaga 

15 caaattctgg ccatcgccgc 

acccgtggag ctcacattaa 

gaatttacct ttcccgcctc 

agtatttctt ctgttgaaag 

aggtagctat ttcattccac 

20 catatagagc agatatacag 

tttcttaacc ttactcgccc 

tacagaccag acacggcggc 

ggccgctcag attcctggag 

ccaaggcgat cagtccagaa 

25 aaagacatct ggcggaaacc 

gatcagacag gagagtgggg 

ttccgatttt ctcatttccg 

tttaagggga gaggagggaa 

tttcgtcaca tgggcttggt 

30 caattgtcat acgacttgca 

ctccttcagc tccacagcca 

tgcccgccgc tgcgatgctc 

atacaggtga gtacctggcg 



93 



cttccatcta attaattcct 


catccaacta 


tgttccaaaa 


60 


caataagccc aggcaactga 


aaagtaaatg 


ctatgttgta 


120 


tcataatctt ggaaaagtgg 


acagaaaaga 


caaaagagtg 


180 


ttttaccagt atctcctatg 


aagggctagt 


aaccaaaata 


240 


aatgccttaa ggcatacgtt 


ttggacattt 


agcgtccctg 


300 


ttcctttgtc catcagaagg 


caggaaactt 


tatattggtg 


360 


ctatttacag ggtaactgct 


taggaccagt 


attatgagga 


420 


tctttccaag aaacaaggag 


ggggtgaagg 


tacggagaac 


480 


caacttagct acaaagataa 


attacagcta 


tgtacactga 


540 


ticicici ciciy ciy u l< v« u l. Luaau 


o.y i_ci i-y l>cl l» 


rr\- a t~ nt~ etc i~ rr 


600 


cctattaagc gtcgtcacta 


aaacataaaa 


catgtcagcc 


660 


cagtctgtcc cgacgtgact 


tcctcgaccc 


tctaaagacg 


720 


ggcggcggga gaggggattc 


cctgcgcccc 


cggacctcag 


780 


aggaagccaa gtgtccttct 


gccctccccc 


ggtatcccat 


840 


ctggctctcg gaagcgctcg 


ggcaaagact 


gcgaagaaga 


900 


tgtgcgcctg gggcggtgga 


actcggggag 


gagagggagg 


960 


actaccccct ctgctcccaa 


attggggcag 


cttcctgggt 


1020 


tgggtaaaaa accctgcccc 


caccgggctt 


acgcaatttt 


1080 


aaatttgtgg ggggtacgaa 


aaggcggaaa 


gaaacagtca 


1140 


tttcagtctt ataaaaagga 


aggttctctc 


ggttagcgac 


1200 


gtgagcgtca ggagcacgtc 


caggaactcc 


tcagcagcgc 


1260 


gacgccctca gacagcaaag 


cctacccccg 


cgccgcgccc 


1320 


gcccgcgccc tgctgctgtg 


cgcggtcctg 


ncgctcagcc 


1380 
1400 



35 <210> 54 

<211> 2501 



1386-WO 



94 

<212> DNA 

<213> Homo Sapiens 



<400> 54 



10 



15 



20 



25 



30 



35 



gataatcttt 


tcatacaaga 


tgcattctgc 


ttttgtgggc 


ctcttgcagc 


cctcaagccc 


60 


ccatctgatt 


tgtacacaat 


gatccagtgg gccagaggag 


cccagagcca 


tgagcggccc 


120 


atccctccaa 


gaactatttc 


tgactgtcca gtatcatgga 


gcaagtggaa 


agaagaaaaa 


180 


aaaaacccaa 


ttacttttcg 


aagagcaaga 


tgaatgctgt 


agaaggagaa 


ggaaggggag 


240 


ggagatggat 


gggtgccgat 


tccagaatct 


tcagatctgc 


ttggatgaat 


cattacctat 


300 


gatttgcggg 


acaagaatct 


gattttattc 


atcaaccagt 


agaaactttt 


ctttctgcct 


360 


cccaacatct 


gaaatccaac 


aaacatgtgc 


cttaggaaca 


taccggtcat 


cttttagagg 


420 


cattttatat 


acatattgag 


taactagaaa 


acactctttc 


cgtaatacac 


acacacacac 


480 


acacacacac 


acaccatctt 


gtcatacaac 


actcccacgc 


aagaaaagcg 


aaactgctgt 


540 


ttgatgaatg 


taaacacttg 


gctgtttgca 


gcagtcggga 


gtcctgccag 


gtttaagtgc 


600 


taagatggga 


ggtgaacccc 


aggggtttcc 


ccctgcccgt 


gctgagatcc 


ttatttggtc 


660 


aagcttctac 


ctatgccctg 


gcctcggagc 


gagcccgata 


gcgctggatc 


acagcagagg 


720 


gagcgaggcg 


gctgacgtcc 


catcccgaag 


agatgaatgg 


aattccagga 


agctagagtc 


780 


atgctggctt 


gggacagtgg 


cttggagacc 


agacttcaat 


gacagaagca 


ctaggcagcg 


840 


gcactcatgg 


caatgtgtgc 


acccacagaa 


atgtaaccca 


cacctcgggt 


tcaggagccg 


900 


aaaaatgaaa 


agaacgttta 


gggaggaaaa 


agggaaatac 


aataataggc 


agagagtaat 


960 


ttattactct 


atgggtctgc 


tctgtaaata 


gctgaagact 


ctggagccag 


atggttctgc 


1020 


aaattctcca 


aacaggagtc 


acgttaagaa 


gcacgagtgg 


gcacaaaaac 


tgtttttcaa 


1080 


gacacaattt 


caatttggct 


tgtggaaact 


ggatacgagt 


aagtttcctt 


aaaattcgag 


1140 


tagaaagcag 


ctgtcctccc 


cgggcccctt 


gatgagaata 


cgcacaccgc 


ccccaagcgg 


1200 


ccggccgagg 


gagcgccgcg 


gcagcgggag 


aggcgtctct 


gtgggccccc 


tggcagccgc 


1260 


ggcaggaaag 


ggcccgaagg 


cagcgaaggc 


gaacgcggcg 


caccaacctg 


ccggccccgc 


1320 


cgacgccgcg 


ctcacctccc 


tccggggcgg 


gcgtggggcc 


agctcaggac 


aggcgctcgg 


1380 


gggacgcgtg 


tcctcacccc 


acggggacgg 


tggaggagag 


tcagcgaggg 


cccgaggggc 


1440 


aggtacttta 


acgaatggct 


ctcttggtgt 


cccctgcgcc 


ccgtcggccc 


atttttcttt 


1500 


ttacaaaacg 


ggcccagtct 


ctagtatcca 


cctctcgcca 


tcaaccaggc 


attccgggag 


1560 


atcagctcgc 


ccgaaagccc 


ctgcgccacc 


ccgcgggccc 


tcctaggtgg 


tctccccagc 


1620 


cccgtccctt 


ttcgggatgc 


ttgctgatca 


ccccgagccc 


gcgtggcgca 


agagtacgag 


1680 


cgccgagccc 


gtgcgcgcca 


aggctgcgtg 


ggcgggcacc 


gacttttctg 


agaagttcta 


1740 


gtgctcccaa 


gccccgaccc 


ccgccccctt 


cactttctag 


ctggaaagtt 


gcgcgccagg 


1800 


cagcgggggg 


cggagagagg 


agcccagact 


ggcccccacc 


tcccgcttcc 


tgcccggccg 


1860 
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ccgcccattg 


gccggaggaa 


tccccaggaa 


tgcgagcgcc 


cctttaaaag 


cgcgcggctc 


1920 


ctccgccttg 


ccagccgctg 


cgcccgagct 


ggcctgcgag 


ttcagggctc 


ctgtcgctct 


1980 


ccaggagcaa 


cctctactcc 


ggacgcacag 


gcattccccg 


cgcccctcca 


gccctcgccg 


2040 


ccctcgccac 


cgctcccggc 


cgccgcgctc 


cggtacacac 


aggtaagtcg 


cccccggcgg 


2100 


ccgccgagga 


ccaaagctgc 


ccgggacatc 


cacctggagc 


gctgaggctt 


cagtccctct 


2160 


ggtggacccc 


ggaacctaca 


ctctccccgc 


tcgcctaccc 


cagcccgctc 


ctctcagccg 


2220 


ctggaggact 


cttcagggca 


aggctccaga 


gccatcctct 


ccagccttga 


ggttcacaaa 


2280 


ccaactcatc 


aggacacccc 


aagatttcct 


tactctctga 


agtcctcctt 


aagcctttgt 


2340 


atcagcactc 


cagggaagag 


tctgtacttc 


ccctgccctc 


cctgcaaccc 


caaactacag 


2400 


ttcctgatct 


tgctcacctt 


cgacttccca 


aaagccccca 


aattgttggt 


cttgcgcccc 


2460 


ccacacttta 


aaaccagcat 


ctctttcctc 


cacctctctc 


t 




2501 


<210> 55 














<211> 7258 














<212> DNA 














<213> Homo 


Sapiens 












<400> 55 














ttcaatagga 


agcaccaaca 


gtttatgccc 


taggactttg 


ttcccacaat 


cctgtaacat 


60 


catatcacga 


cacctaaccc 


aatccttatc 


aagccctgtc 


aaaaacggac 


tttaaaccaa 


120 


gctgcaaatt 


ttcagtaatc 


tggccttgcc 


tttccccctc 


tgatagcacc 


atcaaacaaa 


180 


cccccttact 


gccgaaagca 


ataagcccgg 


ctttgttcca 


tccactggtt 


gtgttggtga 


240 


tatctgggga 


ctgccactga 


acagacgcac 


agagggagcc 


cctacaggca 


ggggtttttc 


300 


tgtctgtgct 


tcttgggaga 


gtatgtctcg 


tacatttgtc 


gcgtgatgaa gacttcacag 


360 


ctccatccag 


cgaccagact 


cacagctcca 


tccagctgcg 


gcaagggggt 


ctgaggcagt 


420 


cttaggcaag 


ttggggccca 


gcgggagaag 


ttgcagaaga 


actgattaga 


ggacccagga 


480 


ggcttcagag 


ctgggctgag 


gtagagagtc 


tcctgtgcgc 


cttctctcct 


ctctgcaatt 


540 


cggggactcc 


ttgcactggg 


gcaggccccg 


gcaggtgcat 


gggaggaagc 


acggagaatt 


600 


tacaagcctc 


tcgattcctc 


agtccagacg 


ctgttgggtc 


ccctccgctg 


gagatcgcgc 


660 


ttcccccaaa 


tctttgtgag 


cgttgcggaa 


gcacgcgggg 


tccgggtcgc 


tgagcgctgc 


720 


aagacagggg 


agggagccgg 


gcgggagagg 


gaggggcggc 


gccggggcgg 


gccctgatat 


780 


agagcaggcg 


ccgcgggtcg 


cagcacagtc 


ggagaccgca 


gcccggagcc 


cgggccaggg 


840 


tccacctgtc 


cccgcagcgc 


cggctcgcgc 


cctcctgccg 


cagccaccgg 


tgagtgccgc 


900 


ggtcctgaga 


tccccgggcc 


ggatgcgcgg 


cggccccagc 


tcccgagcgt 


ctgcctgccc 


960 


cgccctgggc 


tgcccgggct 


ccctgggctc 


cccggcggct 


gcacggagtc 


aaggcgcccc 


1020 
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gtcccgggcg 


tcccccgcgg 


gtgccgatcc 


aggctgcccg 


gagtccggag 


cccatagagg 


1080 




agagagacag 


ctggggagcc 


tggtcaccgc 


gggcatctcc 


cctgcgctgc 


agtcgcccgc 


1140 




ctggcctgcc 


ttcccgttcc 


tccgcctctt 


gccctgactt 


ctccttcctt 


tgcagagccg 


1200 




ccgtctagcg 


ccccgacctc 


gccaccatga 


gagccctgct 


ggcgcgcctg 


cttctctgcg 


1260 


5 


tcctggtcgt 


gagcgactcc 


aaagtgagtg 


cgctcttgct 


ttgactgatg 


ctgcccaagg 


1320 




acctctgatc 


agcaccaggg 


gagaggaggg 


gctgctcagg 


gagctggggt 


ctccggattc 


1380 




catccacagc 


agggccagac 


tctccccagg 


aaatgggaca 


gggtggcagc 


ggaggcttga 


1440 




gaaccacggg 


ggttggcact 


ggctggcaag 


ggaggaagag 


ggccaccggg 


actgccccag 


1500 




cctgcgggca 


tctggtagat 


gaagcttaat 


ccatttctcc 


tggctggaaa 


ccatggtctt 


1560 


10 


ccatttgaga 


actagatacg 


aacagggtga 


ggcgagaggg 


agagggaaga 


gtgggttttg 


1620 




ggattggggc 


cagtttaccc 


tcaccctgga 


tccctggagc 


atgggacctt 


tgatgaagcc 


1680 




tcctcccgaa 


tctcttccag 


ggcagcaatg 


aacttcatca 


agttccatgt 


gagtatccac 


1740 




ccctacaaca 


gttggctgca 


cagacaagtt 


gggaaggctt 


caggggacac 


tcccctccct 


1800 




gccctctgct 


gcagcgtgcg 


ccacccctta 


ccacttccac 


tccccctcgc 


ttaccccacc 


1860 


15 


tttgttctct 


ccagcgaact 


gtgactgtct 


aaatggagga 


acatgtgtgt 


ccaacaagta 


1920 




cttctccaac 


attcactggt 


gcaactgccc 


aaagaaattc 


ggagggcagc 


actgtgaaat 


1980 




aggtatgggg 


atctccactg 


caactgggag 


agaaatttgg 


ggacagggag 


ggatgggtgg 


2040 




gaggcaagag 


caggcaggag 


ttaggagctg 


gaggtagggt 


gggtgacatc 


ttcatcccta 


2100 




tgtgacaagc 


ataaacacac 


acacacgctc 


acgaaacagt 


ggccacacaa 


atgtgaggtg 


2160 


20 


gggttggaag 


gagaccctgt 


ccagtcttct 


ggcaggtctg 


aaacgacatc 


tttaaaatgt 


2220 




ccgttggcag 


ccgggcatgg 


tggctcacgc 


ttgtaatccc 


agcattttga 


gaggtcaagt 


2280 




ttgagtggat 


catttaggtc 


aggagttcaa 


gaccagcctg 


gacaacatgg 


tgtaaccctg 


2340 




cctctactaa 


aaatgcaaaa 


atcagcctgg 


catggtggtg 


gatgcctgta 


gtcccagcta 


2400 




cttgggaggc 


tgaggcagga 


gaattgcttg 


aacatgggag 


gccagatctc 


agtgagctga 


2460 


25 


gatcacacca 


ctgcactcca 


actgggcgac 


agagcaagac 


tccatctcaa 


aaaaaaaaaa 


2520 




aaataaaagt 


tagttggaat 


gttcttctct 


ttctcatatt 


ctctcatcct 


cctgtcccct 


2580 




tgtagataag 


tcaaaaacct 


gctatgaggg 


gaatggtcac 


ttttaccgag 


gaaaggccag 


2640 




cactgacacc 


atgggccggc 


cctgcctgcc 


ctggaactct 


gccactgtcc 


ttcagcaaac 


2700 




gtaccatgcc 


cacagatctg 


atgctcttca 


gctgggcctg 


gggaaacata 


attactgcag 


2760 


30 


gtgaggtggg 


ggcaacaagg 


accaaaagcc 


ctccctacag 


cttcccagaa 


accttgttac 


2820 




catccccttc 


tcccagaggg 


ctggccatag 


cacaagagaa 


gtgcggcctc 


tggttgagtc 


2880 




ttccctgagg 


ggaggaggca 


gggaaggccc 


tctgggttgg 


aatgacatcc 


cctatctttc 


2940 




tgtgttgtgc 


caggaaccca 


gacaaccgga 


ggcgaccctg 


gtgctatgtg 


caggtgggcc 


3000 




taaagccgct 


tgtccaagag 


tgcatggtgc 


atgactgcgc 


agatggtgag 


catcactgac 


3060 


35 


ctgctgatga 


caggtgggtg 


gaaggggaca 


aacttacatg 


tccccttatt 


ccatcacagg 


3120 




aggactgagg 


aggtgggggg 


tgcccgagag 


ggatgctttc 


tcctacctgc 


ctccctaaga 


3180 
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catccctctg 


tttgtcctcc 


aggaaaaaag 


ccctcctctc 


ctccagaaga 


attaaaattt 


3240 




cagtgtggcc 


aaaagactct 


gaggccccgc 


tttaagatta 


ttgggggaga 


attcaccacc 


3300 




atcgagaacc 


agccctggtt 


tgcggccatc 


tacaggaggc 


accggggggg 


ctctgtcacc 


3360 




tacgtgtgtg 


gaggcagcct 


catgagccct 


tgctgggtga 


tcagcgccac 


acactgcttc 


3420 


5 


atgtacggcc 


ctgggtttct 


cctcttcgac 


tcttctgccc 


caccccaagc 


acatcccttt 


3480 




ctccttccca 


gcaaagtgtt 


ccgcctcatt 


tctccctcat 


ctgcccctgt 


ccatgcgccc 


3540 




atggccttgg 


ggacaagtcg 


tgctttgagg 


cctctaggga 


gggaaggaag 


aagtggcatg 


3600 




atttcatggg 


actaagctgt 


ttgatgggta 


tcttcttcca 


cagtgattac 


ccaaagaagg 


3660 




aggactacat 


cgtctacctg 


ggtcgctcaa 


ggcttaactc 


caacacgcaa 


ggggagatga 


3720 


10 


agtttgaggt 


ggaaaacctc 


atcctacaca 


aggactacag 


cgctgacacg 


cttgctcacc 


3780 




acaacgacat 


tggtgagggg 


gaacgcccgc 


gactactgtg 


gccataatgg 


cttggggaga 


3840 




gtgggaccca 


gggagagact 


ggagctgagt 


tgaagctgcc 


ggtggggcag 


gggtggggcg 


3900 




agggaccttg 


aagcctcgat 


atacatgaca 


aaggatggca 


gggaagagtt 


ccatgaagtc 


3960 




tgaggggcct 


ggtgctcctc 


tggagagacc 


ctgaatttcc 


ccaacaagta 


gccctcttgc 


4020 


15 


gagtggaaac 


agccctgtgg 


gtatatggct 


tgggctggga 


aggccctgtt 


tatatgaatt 


4080 




agaaaaagac 


acaccttcct 


ttgtgggatg 


cagcctctgt 


ctgtgctagg 


atatagaact 


4140 




tggagaatgg 


agccttggga 


tggattccag 


cctaactacc 


tcagggggat 


cctctagagt 


4200 




gcagctggga 


gtttttgcag 


aaacgacctg 


tacagctgta 


tgcagtggct 


ctggccatcc 


4260 




aagccttttt 


caacacctgg 


aacaaagc cc 


ttggggcatg 


gggcagggga 


ggtttccagg 


4320 


20 


tgataagcga 


ccagcagacc 


tccctggatg 


actgacctag 


ggataggcat 


agctacttcc 


4380 




tcggcacttg 


gaggggacag 


atggggaccg 


cctaaccagt 


agtgatcttt 


ctcctctgac 


4440 




cctctgtcct 


cccccagcct 


tgctgaagat 


ccgttccaag 


gagggcaggt 


gtgcgcagcc 


4500 




atcccggact 


atacagacca 


tctgcctgcc 


ctcgatgtat 


aacgatcccc 


agtttggcac 


4560 




aagctgtgag 


atcactggct 


ttggaaaaga 


gaattctagt 


aagtgacaat 


tgcgactgac 


4620 


25 


ttagaaggtc 


ctgaggagtg 


ttttgacctg 


aaaatgagcc 


cagtgtgatc 


aagggaagac 


4680 




tgcagagtta 


gaggtgggag 


cactgaggcg 


gtggcagatg 


ggtccaggga 


tggatgaaga 


4740 




gtgttgttta 


gggagcgatg 


ggctgcaaag 


gtaaatagat 


ggtaggggct 


ataggtggag 


4800 




gtaaatggct 


cagatttgca 


tggagagaga 


ataatgggcc 


tctccctggg 


tgatgatact 


4860 




ttatggtgtc 


ccctctctgg 


cgagacgtcc 


cacgtggagg 


cagataaatc 


ttgatgcaaa 


4920 


30 


cgcctccctg 


ttttctccac 


ctagccgact 


atctctatcc 


ggagcagctg 


aaaatgactg 


4980 




ttgtgaagct 


gatttcccac 


cgggagtgtc 


agcagcccca 


ctactacggc 


tctgaagtca 


5040 




ccaccaaaat 


gctgtgtgct 


gctgacccac 


agtggaaaac 


agattcctgc 


caggtgagtg 


5100 




ttccaagcat 


ctctctccac 


ctcttccata 


tctccccaga 


gctcctgggc 


ttgttccagc 


5160 




cagcttaagg 


gtgtctctct 


ctagccaaag 


ccctaagtag 


ccagaatcag 


gagctcaggt 


5220 


35 


ctttgagggt 


ttaaaccagt 


ccttatgtgt 


ttgccagaca 


ttaccaaaaa 


aatcccagct 


5280 




ctgcgctagt 


cacttcagac 


tgggggcacg 


agatcctaga 


aagaggaaac 


agtaaaagac 


5340 
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aatgtaactc agtgcccagg gtgtgttgtg aactataaat gatcaggtgt tcaggagagg 5400 

gaggtgagtg ccaacctgag ggtcagggag gggaggcttt aaaggaaatg tgacttgata 5460 

ggcatttgaa gaggcagagg gaagaaagga aggtgtttca gttgaaagat acaaaactga 5520 

gaaggaggct ggcatattcc gggtggggag gagaactagg gtctgggagt gtggatggaa 5580 

5 tagtggcaga tgacagggct tttaaagcca agcaggggat tttccaactt cgatgtggta 5640 

gaaatggggc tgcgtcaggc acagtggctc atgcctgtaa tcccagcatt gggctaggcc 57 00 

gtagtcgatg gatcattgag gccagagttg agaccggcct ggaccaacat ggtgaaaccc 5760 

tgtgtctact aaaaaatgca aaaaaaaaaa ttagccaggt gtggtggtgc ctgcctgtaa 5820 

tcccagctaa tcaggaggct gagacatgga atcgcttgag cacaggaggc aagtttgacg 5880 

10 tgagctgaga tcacgtcatt gcacgccagc ctgggcgaca gagcgagatt ctgtcctccc 5940 

gccgaaaaaa gaaagaaaat gggaagtcgc taaggacttt gactgggaaa ctcttccctc 6000 

tctctggtat ggttgggtga tgggatcaga aatcccctcc tcacttctct agggctcatc 6060 

ttttgtatct ttggcgtcac agggagactc agggggaccc ctcgtctgtt ccctccaagg 6120 

ccgcatgact ttgactggaa ttgtgagctg gggccgtgga tgtgccctga aggacaagcc 6180 

15 aggcgtctac acgagagtct cacacttctt accctggatc cgcagtcaca ccaaggaaga 6240 

gaatggcctg gccctctgag ggtccccagg gaggaaacgg gcaccacccg ctttcttgct 6300 

ggttgtcatt tttgcagtag agtcatctcc atcagctgta agaagagact gggaagatag 6360 

gctctgcaca gatggatttg cctgtgccac ccaccagggt gaacgacaat agctttaccc 6420 

tcaggcatag gcctgggtgc tggctgccca gacccctctg gccaggatgg aggggtggtc 6480 

20 ctgactcaac atgttactga ccagcaactt gtctttttct ggactgaagc ctgcaggagt 6540 

taaaaagggc agggcatctc ctgtgcatgg gtgaagggag agccagctcc cccgacggtg 6600 

ggcatttgtg aggcccatgg ttgagaaatg aataatttcc caattaggaa gtgtaacagc 6660 

tgaggtctct tgagggagct tagccaatgt gggagcagcg gtttggggag cagagacact 6720 

aacgacttca gggcagggct ctgatattcc atgaatgtat caggaaatat atatgtgtgt 6780 

25 gtatgtttgc acacttgtgt gtgggctgtg agtgtaagtg tgagtaagag ctggtgtctg 6840 

attgttaagt ctaaatattt ccttaaactg tgtggactgt gatgccacac agagtggtct 6900 

ttctggagag gttataggtc actcctgggg cctcttgggt cccccacgtg acagtgcctg 6960 

ggaatgtact tattctgcag catgacctgt gaccagcact gtctcagttt cactttcaca 7020 

tagatgtccc tttcttggcc agttatccct tccttttagc ctagttcatc caatcctcac 7080 

30 tgggtggggt gaggaccact ccttacactg aatatttata tttcactatt tttatttata 7140 

tttttgtaat tttaaataaa agtgatcaat aaaatgtgat ttttctgatg acaaatctcc 7200 

ctggtgcttg tatgggaagg agttggagta cataaaaagg agaaaataac aaaggtgg 7258 



35 



<210> 56 
<211> 852 
<212> DNA 
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<213> Homo Sapiens 



<400> 56 



cagctgcgct 


ggaggctgag gccgattgct 


tgageccagg 


atttggaggc cagcatgcgc 


60 


aacataatga 


gacccagtct 


etaaatgeat 


gcctctctat 


atattaaaat tctgatgtga 


120 


aaatatttta 


aaatttaata 


catttcaaat 


gtttttaatt gtataataaa caaaatgtaa 


180 


ataataaaat 


aatttaatat 


taaattcaaa aatgaggtag aaacaaagca cagegatata 


240 


aataataaat 


tttcctttac 


atttttgagg eggtcttttg agttttggat ttccttctta 


300 


ggtcactgaa 


atgtgctcct 


tggagecage 


ccgcaaatca 


cgcatttaga aaaacataac 


360 


tatacactcc 


taaccctaag 


tattagaagt 


gaaagtaatg 


gaatctcgat gtaaacacaa 


420 


tatcactttt 


ttgtagagct 


attttgagta 


taataaattt 


gaactgtgcc aatgctggga 


480 


gaaaaaattt 


aaaagaagaa 


eggagegaac 


agtagcttcc 


tcgtccgctg actagaaaca 


540 


gtaggacgac 


actctcccga 


ctggaggaga gcgcttgcgc 


tcgcactcag ttggcgcccg 


600 


ccctcctgct 


ttttctctag 


ccgccctttc 


ctctttcttt 


cgcgctctag ccacccggga 


660 


aggcactgcg 


gtagctgggc 


tctgattggc 


tgctttgaaa 


gtctaeggge tacccgattg 


720 


gtgaatccgg 


ggccctttag 


cgcggtgagt 


ttgaaactgc 


tegcacttgg cttcaaagct 


780 


ggctcttgga 


aattgagcgg 


agagegaege 


ggttgttgta gctcgctgcg gccgccgcgg 


840 


aataataagc 


eg 








852 



20 

<210> 57 

<211> 2501 

<212> DNA 

<213> Homo Sapiens 

25 

<400> 57 



tcttgtcact 


ccatgcactg 


tgttccgtat 


gctaaatagt 


ttgagaaacc 


caaatgggee 


60 


atgttcgcct 


acatttcatt 


gtcctgtact 


tcctgtcctg 


tactagcaaa 


gcagtcccat 


120 


tggtctttct 


tctcctcatt 


aacaataaag gtaacacttt 


tgatgttgtt 


tcttcagaaa 


180 


accttcattc 


atcaaaactg 


cctcaaagat 


catgtttgtt 


tgattccaga acttcctgta 


240 


attacctgtt 


attgtaacac 


tcatcactgt 


attttactta 


cttgtgtaac 


taattttcca 


300 


tattctgeae 


tagacaacaa 


agtcctttaa 


gtcaggtact 


atatctattt 


acatagcatt 


360 


cacatctcct 


acaataaggg 


acattagcag 


ataaacaaca 


catattaaat 


gaataatgaa 


420 


gtttctgaaa 


tactacagtt 


gaaaactata 


ggagctacat 


tatatagaat 


aaacatttac 


480 


tttgetatag 


aattcagtgt 


aacccaggca 


ttattttatc 


ctcaagtctt 


aggttggttg 


540 
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gagaaagata 
tttgaaaatg 
tcgtactttc 
ttgcccagaa 
5 acagaatcca 
aaagccacct 
gggtcctcag 
accagagttg 
ctccctccca 

10 taactttaca 
catcttcctc 
attctgctct 
atagtcagtc 
ctagtcttat 

15 tggaactgat 
actaaatctt 
ctctaaaatg 
gcttatcttc 
aaaagaaaat 

20 atggtactat 
tgtggagctt 
actttcgtca 
ccctgagggg 
tgagggtgca 

2 5 aggcacaaac 
accaccggaa 
cttggcagcc 
cagcgttttc 
taacaaacat 

30 ttgaagattt 
gatattctgc 
tttatagtat 
gcaaggtaac 



acaaaaagaa acatgattgt 
gcattccccc tccacagtgt 
tgaaaatgaa gaactgttac 
gcgaacagac ttttactatc 
agacctttca agaaaggtct 
ttttatgatt tgttgaaatt 
aggtcagact tggtgtcctt 
gaacccagat gctctttcca 
ccccaaccaa gcagctccag 
acttgagtgg ccttgaatac 
tattgaagcc ctcctattcc 
tatgcctcca ctggaattaa 
cttacattgc tttcttcttc 
ctatttctag atcccccaca 
ttctatgtga agcacatgtg 
ttgaaaagtt gtagtatgcc 
cttagaaaaa gatttatttt 
accatcatga tagcatctgt 
catccatgat cttgttctaa 
gataaagtta tctagaaata 
cagtatttta aatgtatatt 
tactccgtat ttgataagga 
atgggccatc agttgcaaat 
taagttctct agtagggtga 
tttcagagac agcagagcac 
ggaaccatct cactgtgtgt 
ttcctgattt ctgcagctct 
ctatgtctaa atgtgatcct 
cctttttatt cagaaacaga 
agaaaaaact atatataaca 
ttttataatt tataccatgt 
gtcctattga gaaccacggt 
tcagacaatt ccactccttg 



100 

gcagaaacag acaaaccttt 
gttcacagtg tgggcaaatt 
accaaggtga attatttata 
ataagaaccc ttccttggtg 
tggattcttt tcttcaggac 
tctcactcca tcccttttgc 
ggataaagag catgaagcaa 
ctaagcatac aactttccat 
tgcaccactt tctggagcat 
tgttcctatc tggaatgtgc 
tcaatgcctt gctccaactg 
tgtcttagta ccacttgtct 
tgatagacca aactctttaa 
ttactcagaa agttactcca 
ccccttcact ctgttaacat 
ccctaagagc agtaacagtt 
aaattacctc cccaataaaa 
aattaactga aaaaaaataa 
cacctgccac tctagtacta 
aaaaagcata caattgataa 
aaaattaaat tattttaaag 
acaaatagga agtgtgatga 
cgtggaattt cctctgacat 
tgatataaaa agccaccgga 
acaagcttct aggacaagag 
aaacatgact tccaagctgg 
gtgtgaaggt aagcacatct 
tagatagcaa agctattctt 
atataatctt agcagtcaat 
cttaggaaag tataaagttt 
agcatgcata tatttaacgt 
tacctatatt atgtattaat 
tagtatttca t 



ttggaaagca 


600 


cactgctctg 


660 


aattatgtac 


720 


ctctttatct 


780 


actaggacat 


840 


tagtgatcat 


900 


cagtggctga 


960 


tagataacac 


1020 


aaacatacct 


1080 


tgttctcttt 


1140 


cctttggaag 


1200 


attctgctat 


1260 


ggacaagtac 


1320 


taaatgtttg 


1380 


gcattagaaa 


1440 


cctagaaact 


1500 


tgattggctg 


1560 


ttatgccatt 


1620 


tatctgtcac 


1680 


ttcaccaaat 


1740 


atcaaagaaa 


1800 


ctcaggtttg 


1860 


aatgaaaaga 


1920 


gcactccata 


1980 


ccaggaagaa 


2040 


ccgtggctct 


2100 


ttctgaccta 


2160 


gatgctttgg 


2220 


taatgttaaa 


2280 


gatcaatata 


2340 


aaataagtaa 


2400 


attgagttga 


2460 




2501 
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<210> 58 
<211> 2501 
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<212> DNA 

<213> Homo Sapiens 



<400> 58 

5 

attaattctg 
gaaaccatat 
ataaatgctc 
tcaacctggg 

10 acttgatcaa 
aatgttgcaa 
ccaaggatcc 
acccttttga 
taaaggacct 

15 cctagatatt 
caacttattt 
cagaagcctc 
gttacagagt 
agctttagaa 

20 ctagtattta 
actaagtgga 
ggtcctgacc 
taaaaggaag 
ctatgtgtgt 

2 5 ctggggggaa 

atttttaatt 
cagatggatt 
taaccaaatt 
caatttcagt 

3 0 tgttgctaaa 

ttttaaaaaa 
ttaatctctt 
tagtcgttat 
tctctcccca 
35 gtcctttatc 
tttatagtct 



caaattttaa taaatgcttt 
gagttagcaa agtagaaaat 
catcaatatg cacttgttgt 
aggagcaggt aggacttcag 
acatttgcca agaccactat 
gcttcgaatc atatccaaac 
tattggccat tgtctttgag 
tctattctga agttaagtag 
ctgggttact tgtttataag 
tggtaggtga agtcatacct 
ttccctcctt gtcttttagt 
tccaaccagg tagggacagt 
acaaatgtta tatagtacaa 
ttttcttgta gaatttaaca 
gaattttctt ctccaaatgt 
aaatcatgca ttagatattg 
atgagaggaa ctgatgatcc 
tggctattcc ggtttctttt 
gcaaggctaa gccagttagt 
aaggaacata gagttggtta 
ttttaactgg gtttgcggtt 
ccatacactt gcatttaagt 
attggtaact tttctaaaac 
tacagtcctt tactttctac 
agaactattt ttatgaataa 
tgactcatca aaagaaataa 
ttagaattgc aaacatattt 
ttatccagtt caaaaaccag 
cccccttaac ccacattgga 
tttccagtaa agatagcctt 
tggaagtgtc ttcttttgcc 



attttaagct aaatgctgag 
ataggcatat taatcagtaa 
agtgaggcca ccgaggaggg 
atgtcatcca actcaaagat 
gagttaaatg aatagattag 
tcagaacaac atagcttggt 
cctcaaagga acatattaaa 
tgaatttaca tgatgatgac 
ctagtatttc ctgaatcaat 
atatatcccc acaccctaga 
gggagccaca tcagtatcca 
tatagattcc agacctcagc 
gtttattgta cacatcccat 
gttttttcat gccgtattta 
ataacgttta ttattgcatt 
tagaagtaga tacaacaatg 
aatgggggag atagacctgc 
tgatgggcaa gcattttgca 
taattgggat ttttttaaaa 
ttgtcccctt gcctataata 
aaatctcaca gcccaagaga 
atgcaaaaaa attccaatta 
aaaaaaaaat tgtttccctt 
tcaagaaaat agtttcaaaa 
atataaaact aagaagttat 
ctttttcctt tctcttgtaa 
ccttgatgga gaaaatcaat 
agtagaattt actactctgt 
ttcagaaagc ttcattctgc 
ttggagtcga agatgaggaa 
aggacagaga gaggagcttc 



atgaaaaaat 


60 


atgcagaatg 


120 


tgcaatcctc 


180 


atagtgaggg 


240 


gcatttctcc 


300 


cataatgatc 


360 


actccataat 


420 


acaaacactg 


480 


ttttctgatc 


540 


acagcatctc 


600 


agaggagatc 


660 


tatggccttt 


720 


tgagtctctg 


780 


catattattg 


840 


ttttgtatcc 


900 


aacaagaact 


960 


acgtgtttaa 


1020 


aggccttggg 


1080 


aggcacttca 


1140 


aaaacctatt 


1200 


tttgccactt 


1260 


tccagcaatt 


1320 


gttttggcag 


1380 


agttgatgtt 


1440 


ggtgtccctt 


1500 


gagaaaaaaa 


1560 


tcacatggca 


1620 


ctccattttt 


1680 


aatcagcatt 


1740 


aagcctgtat 


1800 


agcagtgaga 


1860 
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5 



20 



25 



30 



35 



gcaactgaag 


gggttaatag 


tggaacttgg 


ctgggtgtct 


gttaaacttt 


tttccctggc 


1920 


tctgccctgg 


gtttcccctt 


gaagggattt 


ccctccgcct 


ctgcaacaag 


accctttata 


1980 


aagcacagac 


tttctatttc 


actccgcggt 


atctgcatcg 


ggcctcactg 


gcttcaggag 


2040 


ctgaataccc 


tcccaggcac 


acacaggtgg 


gacacaaata 


agggttttgg 


aaccactatt 


2100 


ttctcatcac 


gacagcaact 


taaaatgcct 


gggaagatgg 


tcgtgatcct 


tggagcctca 


2160 


aatatacttt 


ggataatgtt 


tgcagcttgt 


aagttatttc 


ccttcatctg 


tttcaaatgt 


2220 


tagcattcaa 


ttttagccct 


ggttttggct 


tcagtcagtt 


ttgcgatagt 


agtgaagtaa 


2280 


agacactagg 


attttaaaca 


gtaggaaaag 


ttaatttagt 


ctaactttta 


atatgcaatt 


2340 


gagttttgct 


atataccatt 


gtactgtcat 


agttagagct 


gaaaattgat 


gtttttggta 


2400 


tctttttttc 


caaaggcaat 


tgagtaattt 


ggattctgtc 


tctagtcggt 


ctgtctcttt 


2460 


agtttcctat 


acttgacaat 


gaggtcaaac 


ttagcaaata 


a 




2501 


<210> 59 














<211> 2501 














<212> DNA 














<213> Homo 


Sapiens 












<400> 59 














ataaaaaaag 


acatgaaatg 


aatcggggaa 


aatatttgct 


acataactaa 


gaatgaaggc 


60 


ccttaataaa 


atctgtaaaa 


ctatacacac 


ttttaggaat 


gaatcaacaa 


ataatttcta 


120 


tgaattagaa 


aaaagtgaca 


atccaactaa 


aaaatgaata 


agggatataa 


gcaatgtgtt 


180 


tcacagaaaa 


aataaaaatt 


gacaatgaag 


ttatgaaaaa 


atgttcagtc 


tccttagtaa 


240 


ttgcacaaaa 


caaactaaaa 


caatgagaca 


ttacccctaa 


gattagtaaa 


tgttaaagaa 


300 


aaataataat 


tggtgagggt 


gtggggaagt 


gggcacttac 


acctatgttt 


ggaaatataa 


360 


attggtgcaa 


ccttataggg 


agagcaatct 


cacaacattt 


tccaaagact 


tacatgcaca 


420 


accctatggc 


agagaaattt 


attcctcttc 


caggattttt 


tttccttcaa aaacagtgat 


480 


gtggatgaaa 


aacacatgtt 


cactactgca 


cagggtataa 


cagctgaaaa 


ctggaaacga 


540 


taatactcac 


attcccttca 


gtaggggaat 


ggttaaataa 


attttacaag 


ccatctggta 


600 


gataccaggc 


atgagctaaa 


agttagggtc 


cagttagaga 


tggaaagcac 


accagtaatt 


660 


tgaaagggaa 


aatgtaatat 


gaagaattat 


taactagtaa 


aagaaggcta 


actgctaaag 


720 


gtacaagagc 


actcaagctg 


tctgcagtca 


gcaggccccg 


gctggtgagc 


aggaagctgc 


780 


ccgctgggag 


gctgccaaag 


ttccctgaag 


gtgagcacca 


ctggttctac 


aagctgctgg 


840 


cagtcatggc 


gttaagagca 


ggaagagaag 


caccagaacc 


cggaagagaa 


atccagtcct 


900 


ctgctaggcc 


ttgcaccgtc 


cctctggcgc 


cctctactga 


caaagccagt 


aaaattgtgc 


960 


cgctagcaaa 


ggagatcttt 


ttatgggatg 


tagcttggtg 


tcaccaaaga 


gaacagagtg 


1020 
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gacttggagc tcagatgcaa cacaatgatt gatactggca cagtatactt accctgcttt 1080 

tgtaaacaaa atggtatatg tgatgtctct ctttgtctct ctgtatataa aacaatattt 1140 

gtttctactt attatgtatt tatgtcttta ctctgcatgc caggagctaa gtattttgca 1200 

tgtattaact cattttgttc tcataataac cttcacatgc aggaatcatt atagctactt 12 60 

5 tatgaatgag ccgaggaagg cactgagacg ttaagtaact tgcccaaggt cacgcagcta 1320 

gtaagtggca gagcaagaat tactatggct ttataagcct aggaaaaagt ctgaaagaat 1380 

caaaatgtta acagcgggga cctcaaggaa gcattgaaga ggccatggga gaagttttca 1440 

ctttgttaaa aaatcagtcc ttcaaataaa taaatacagt gaggcttccc cagaagcaga 1500 

tgtcactatg cttcctgtac agcctgtgga actgtgagcc agttaaacct cttttcttta 1560 

10 taaattatcc agtcttaggt atttctttat aacagtgcta ggatgagctg atacagtttc 162 0 

ctacactgta acctaaggca atgctttgca caaagggatg agccagattg cttagtaatt 1680 

aaaacgcaaa tacaaaccac aagcatatcc attcatgaat tggggggctg ctttgtgtgc 1740 

atagataagg tatatttttt aaaaaaatta tttttccaag aagaaaataa accagttaat 1800 

aaacgacaac tcacagtgcc aggaagtgag aaacaagtgt gtgataaacg gtggagaatg 1860 

15 ggagcactct ccgcagtggg cgggaggaga cgaggagggc gttccctggg gagtggcagt 1920 

ggttggagca aaggtttgga ggaggtaagt catgtgctct gagtttttgg tttctgtttc 1980 

accttgtgtc tgagctggtc tgaaggctgg ttgttcagac tgagcttcct gcctgcctgt 2040 

accccgccaa cagcttcaga agaaggtgac tggtggctgc ctgaggaata ccagtgggca 2100 

agagaattag catttctgga gcatctgctg tctgtgagat taagcactat gtatattgct 2160 

2 0 ttattcactc cccacagcaa ccttaccaag cagttctttt ccacgtgaaa agatggaggc 2220 

tgggtggagc aaaaggaggt atttagagtc ctcagcaagt gagaggcaga gctgggattt 22 80 

gaatccagat ctgcctgata ctgaagtcta ggctggttcc acctctccgg actgctttcc 2340 

agggagtaga agacagatat tttaccttag ctggctgctt ctagaagtct gaccctgctg 2400 

gctcaaaacg actttagttc cttgcccaga ggctgcgggc tgcgggtcaa gacatcagta 2460 

25 gaaggagggc ccagccagag aggctgacat gggcttctac t 2501 

<210> 60 
<211> 2501 
<212> DNA 

3 0 <213> Homo Sapiens 

<400> 60 

cgggcaggaa taatcactgc ctcccatccc cttaaacatg ccaagatgct ttatccctag 60 

35 gatgaggtga cttactccag gtaactccta ttgcctaacc actgaccaat tactctgccc 120 

tttagtcttt atgtcattaa atctgcatta agaatttcat ggaataggcc cggcatggtg 180 



1386-WO 



104 





gctcatgcct 


gtaatcccag caccttggga gaccgaggtg 


ggaggatcac 


ttgaggtcag 


240 




cagttcgaga 


ccagcctgga 


caacatggcg 


aaaccccatc 


tctactaaaa 


acacaaaata 


300 




actagccagg 


tgtggtggtg 


ggcacctgta 


atcccagcta 


tttgggaagc 


tgaggcagca 


360 




ggagaatcgc 


ttgaactggg 


gaggcagagg 


ttgcagtgag 


tcgagatcgt 


gccagtgcac 


420 


5 


tccagcctgg 


gcgacagagc 


gagactctgt 


ctcaaaaaaa 


aaaaaaaaaa 


aaactcaggg 


480 




aatggatagc 


agcattgatg 


aatattgcgt 


ctggagagat 


cagatcactt 


gtcacttgtt 


540 




tccaggcaca 


gggcttacca 


agaggcagat 


tccagattta 


aataattctg 


taacagcaaa 


600 




gtccaagcta 


ttttcactgc 


tttggagaaa 


agacccagac 


ccagagcttg 


aacctcactt 


660 




tgcagcaccc 


cagttctaat 


cttttaagtt 


tttttttttt 


tttttttttt 


tttctgctgg 


720 


10 


gcacggtggt 


tcatgcctat 


aatcccagca ctttgggaag 


ccgaggggga 


aggatcgctt 


780 




gaggccagga 


gttcgaaacc 


agtctgggca acatggcaaa 


accccatctc 


tacaaaaaat 


840 




acaaaaatta 


ggccagagtg 


gtggcgcgca cctgtagttc 


cagctacgtg 


agaggcggag 


900 




gtgggagaat 


cgcttgaacc 


cgggaggcag aggttgcaat 


gagctcagat 


cccgccactg 


960 




cactccaggt 


tgggcgacag 


agcgataccc 


tgtgtgaaac 


tttttttttt 


ttctccaacg 


1020 


15 


ggctttccag 


agaagtgtgt 


gtatgtgcgt 


gtgtgtgcgc 


gagcgtgctt 


gcttgggctt 


1080 




aaactttctg 


tcgggccaca 


ctttcccaag 


tctttgcact 


ggctgtaggg 


tgggctttat 


1140 




cctcgggacg 


tcctcctccc 


caagtccagc 


ctgcagctgg 


aagtcttcac 


tgatctccat 


1200 




ctctcctccc 


tgatctccgt 


ctctcctccc 


tgcccgcctc 


aggactggga 


ggccgatctc 


1260 




tctctctcgc 


cctcccctcc 


accagccttt 


tccagatgta 


tgtctgccaa 


agacccccca 


1320 


20 


gtgcagagga 


tgatgaatga 


agatcctcga 


gccagcccgg 


tgggaaagtt 


tcgtcgccta 


1380 




caaaagcgag 


ggaaagggaa 


gggaagttgg gggtagggga 


aaagttagag 


ctgagaggct 


1440 




ggggcgcgac 


gagtctggac 


accgggcggg gacccaagct 


ctctccgctc 


agccaataac 


1500 




tgtgcctccc 


ttaggaaggc 


gtgaggaaat 


gctccaatca 


atccctgcac 


tcctcccttg 


1560 




gaatttgggc 


tgtatttttt 


tatttactgc 


aaaccccaca 


atccacccag 


gggtttcccc 


1620 


25 


agtgtttgcc 


tccagcggtc 


ccggtgccca 


tttactagtg 


ctgctccctc 


tcttccgcaa 


1680 




gactgcgctc 


cagtcccagc 


ctccttctcc 


gcgggtgcct 


cccaaaccgt 


tctatcattc 


1740 




tcgggttcag 


ggaggcggaa 


tcgtgcctgc 


tctccggttc 


ctttaagagg 


cgtcggctcc 


1800 




acccctctca 


gagtcgcggt 


ctgacgcgag 


atgacagcaa 


cgagttcggt 


atgtctatgc 


1860 




aaataagcgc 


cctcttgtgg 


gccaatgggg 


agcggaggtg 


ccggaaccac 


ggaccaatgg 


1920 


30 


ggcgggggcg 


ctggggctca 


ccatataagg 


agcggcctcg 


ccataaaagg 


aaacattgta 


1980 




tctctttata 


tggggggaag 


ggtcggggga 


tccctccgcc 


gccagcgcgt 


ggtcccggcc 


2040 




ccctccaccc 


gccgtctcgg 


ccgcggccag 


cagcccctgc 


cccccggggg 


acgctgacgg 


2100 




ccgcccggcg 


cgccgcccta 


gcagacggac 


agggggcgct 


gcgcgcggcc 


tggggcaacc 


2160 




cgggccacag 


gggcaggaaa 


gtgagggccc 


aggtcggccc 


gggcgtgcag 


gggccccggg 


2220 


35 


ttcgcagcgg 


cggccgcggc 


agcgatagcg 


gcactagcag 


cagcgggagt 


gccgggttga 


2280 




gccgggaagc 


cgatggcggc 


ggctgcggcg 


gctccgattc 


ctcgctgact 


gcccgtccgc 


2340 
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cctcctgcat 


cgagcgccat 


gttaccgacc 




ggctcggccc 


tggggggcag 


cctgaaccgg 




gggacacgcg 


gggctaacgg 


gggccgggtc 


5 


<210> 61 








<211> 2501 








<212> DNA 








<213> Homo 


Sapiens 




10 


<400> 61 








ggaaccctct 


gatagagagg 


gctgactgta 




ggttgtattg 


gtggacccat 


gcagctcaaa 




cagagcttat 


aggaaaatac 


ctctcccagt 


15 


ttattatgat 


tagccacggt 


ttgctattta 




cagttatgct 


agccacattt 


aaagtgctca 




cggacagcac 


atatttagaa 


cattcccatc 




cggaaaaagg 


gggccatcat 


aatgtgagtc 




aaaggaggtc 


tgtttggcat 


tgtgtaatgg 


20 


acccaattcc 


tattcatcag 


ccaggtgtgg 




gaggccgagg 


tgggaggact 


gctgcagccc 




gggagatcct 


gtttctacaa 


aacaccaaaa 




aggacagtcc 


tgggttaaaa 


cctttgctct 




ttcttatctg 


ttaacggtag 


gtacttcttc 


25 


ccaagattgt 


gtctggcaca 


cagtagcttc 




atggctcctt 


tatcccgttt 


tgggcccatg 




taagttctgt 


agatggcata 


tccttgggaa 




gctctggtta 


agttttctcc 


tataactttc 




tttcctaata 


cttactgatg 


gtagtggcat 


30 


tgagtggaga 


gaaaggtctg 


tttctcaggc 




cagtttggtg 


gtgcaggccc 


tgaaatcaga 




ctctaagcta 


tgtgaccttg 


ggctagatac 




aataataata 


ccgtctaggt 


tgttaggagt 




agggactgtg 


ttaaatcatt 


aaataaataa 


35 


ggggattaaa 


tgagataata 


tatgaagaca 




ttatttttat 


gtctgcaaac 


tgacttaaag 



105 

caagctgggg ccgcggcggc tctgggccgg 2400 

accccgacgg ggcggccggg cggcggcggc 2460 

cccgggaatg g 2501 



tttattgaaa 


acaaaacaaa 


acaaaacaag 


60 


cccttgttgt 


tcccaggtca 


actgtatatc 


120 


aaccctgctc 


accatttctc 


tcttaagcta 


180 


aatttaaatt 


taaataaaaa 


tgtggccttt 


240 


atagccatat 


gtggctaatg 


gttactattt 


300 


atttcagaaa 


ttttcattgg 


gaacactctg 


360 


catcttctgg 


aaaaatcctg 


ggaaggggac 


420 


taatttggta 


tttaattttc 


aaaaatgttt 


480 


tggctcttgc 


ctgtaatccc 


agcactctgg 


540 


aggagtttga 


gaccagcctg ggtaataata 


600 


acaaaacaac 


aactttgatg 


ttgtggagtc 


660 


ccttagctgt 


gtaaaccgtg ggtctcagct 


720 


ctagggctgt 


tttgaggatt 


aagtgaaagt 


780 


tcagcaaatg 


ttttcctcct 


atgtcaggga 


840 


ggtggccctg 


aagggtgggt 


gctcaggtgt 


900 


aagcaaggca 


attaaaaaca 


gtgagaggtt 


960 


cccatgggtc 


aattgggtag 


aatctgccat 


1020 


tcggaagcac 


aatagctgaa gccggagctc 


1080 


ccaaaaagag 


gttacacacc 


catggctgtc 


1140 


ccaaactgga 


tttaaatccc 


caaacctata 


1200 


ttcacctctc 


tggccttatg 


aagtaggaat 


1260 


attaaatgag 


gtaaagcact 


gaaaacgttt 


1320 


aaacggggat 


gaccttatcg 


gcttgacaca 


1380 


agtacacggc 


aaatgcttaa 


ttaatgttgc 


1440 


gggaggcctt 


taagaaagac 


agtggggcaa 


1500 
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tttgcgcgtt 
accaggtaag 
cccaaaatca 
cacgcatctc 
5 gggaccgagt 
gaccagcggg 
ccgcgccaca 
gcgccgcgcg 
ggctcaggag 

10 ccatggagac 
acgggcatgg 
gagtccccgg 
ctggtgggtg 
ttagagggag 

15 agtacctgct 
gggggcgaat 
agaactggcc 



gatgcattgt aggagaaaat 
cggcagaaaa ccacaaatac 
cgcaaaactt ggtttgctaa 
agtctgtgaa atgggaccca 
attttgtgcg ctccggtaac 
caccgcctgg aggcgccttc 
agactcacgc ccgaaccacg 
ccgcttccgg taggggcgga 
gtccgcggga ggatggagca 
ggtacagctg aggaacccgc 
ggtcagggga agaaaaggcg 
agctaagcga cccagccgat 
agggtccggc tgaggggagc 
cactcgctgt ggtgagactg 
gaggccgggc cactcggggg 
gctgaagcag ggtgagaatc 
aagaggaagg ccgggtggac 



gtgcaggggg cccgttggga 
ctccaggcgt tcctggggca 
gaattgtcag ctcttctaaa 
ggacccaggt agaggtgcgt 
gcaggaagac agcgccactg 
accacttggc ggttccgggt 
tgatcagggc cgtggctccg 
aagcggaagt gtgggagggt 
gtgagcgggt ctgggcggct 
cgcgccggtg aggggccact 
ggaactggtt gaggggatac 
ggggcacctg ctgagtgagg 
atctgctaag gaggttagac 
tgctgaggaa cgtggggaca 
aacgctatcc aagcagggac 
tgtgagggat ctctttaagg 
tttctaaggg t 



ccagagttca 1560 

gcgccgcctc 1620 

ggaggcgctt 1680 

tctcggcctg 1740 

acactctaga 1800 

ccgcgcccca 1860 

ccccgctccc 1920 

ctgcggggcg 1980 

gctggcagcg 2040 

ggctaagagg 2100 

acctgtgtgg 2160 

ggggggacgt 2220 

ttgggaccgg 2280 

agttagggag 2340 

tcacggaggt 2400 

gggtggatcg 2460 
2501 



<210> 62 

20 <211> 2501 

<212> DNA 

<213> Homo Sapiens 



<400> 62 

25 

gcatggtggc 
ggtcaggaga 
aaaaaattag 
caggagaatg 

30 tgtactccag 
tgatactaag 
aggaggttgt 
aactgcaatc 
tccaataatg 

3 5 aacaatggct 
accattatca 



tcacgcctgt aatcccagca 
tcgagaccat cctggcgaac 
ccgggcatgg tggcgggcgc 
gcgtgagccc aggaggcaga 
cctgggcaac agagtgaggc 
taatgcggaa ggtgactcaa 
agatggatct agaatttccc 
tttgtgcagg ctgggtttgc 
ctgacaactc tgaaaaccat 
attttttcgg gtagttgaag 
aaatcttcca tttatgtttg 



ttttgggagg ccaaggcagg 
acggtgaaac cccgtctcta 
ctatagtccc agctactcgg 
gcttgcggtg agctgagatg 
tccgtctcaa aaaaaaaaaa 
agggggaaag gaacacagca 
cctcatttcc atcaggtgaa 
tttgtacaca ctggtcccct 
ctgtagacat tctgcaggct 
caaaattaag tccaatgata 
ttaaagcaac ctaagtatga 



cagatcacga 60 

ctaaaaatac 120 

gaggctgagg 180 

atcgggccac 240 

aattactaca 300 

gtgtaaagga 360 

agcctgagaa 420 

agtgttcatc 480 

ccatctcagg 540 

agcaaatata 600 

tctgagaagg 660 



I 
i 
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actctgtatt ctatatttga gtccttgtgg atgaactgta acctagctta ataggcagac 720 

aagattgaaa acctaattta ggagtatgtg cctttaacaa tagctgagtc ttggccaatc 780 

ccagtggcca tacttcaacc attcatacac tgctgagtgt tcaaactgtg ttcaaagaag 840 

gcaaaagcca acctgtaacc aatccagttg tttctctgcc ttacctccaa tttctgtatg 900 

5 tcacttccct ttttttgtct ataaatatgt tctgaccatg aggcatccct ggagtctctg 960 

aatccgctgt gattctggaa gctgccccat tcgcaaatca ttcattactc aattaaactg 102 0 

ctttaaattt aattctgctg aagttttctt ttaacaggtt tagaaaaaat aatggcaaaa 1080 

atgaatgaaa atccaataac cctggaagca gaaaaggctg ggggctccaa taagtgtaaa 1140 

tagtcccatc cctatatttt ctccatggca attacaatcc agcacattat atatatattt 1200 

10 ttttgcttct cgcattttgg cttagggtaa agctttttaa aacaggcact gccaaccagt 1260 

gttatcaaga aggtctggat gccgttttgt gggaacattt taaagaggaa tgtccaaaag 1320 

gaaaaggggg atgggttggg agaagggtat caggcgggta tctcaaaacc attcttaggg 1380 

ctataggttt aatttatttg gttgtggacg tcagagccgt catggtaaga aggaagcaaa 1440 

gccttttgta ataattaaag ccttcagaag cagcgtgccc cattgcccac tagtgcgccg 1500 

15 tgaagtctgg tgttcaccta cagggtccct ctcagcactg cccaggcctc ccgagtgctc 1560 

cagcacagta gcttggagct tgttggtttg gtgaccaaga tacactccag ggaatatgcc 1620 

atgcagtgga gtctcttccc cggcactgca tagcaaaagg aaagggccgc tgggtgtctg 1680 

tgggtcctgg gcagtcacag aagccaccgc gctggcgggg aggaggggga ccgatgcggt 1740 

ccatgtcccg ggcagcccca ctttctctgc ctgcgaaggg cccttgtccg gcgggaggag 1800 

20 agaggcgcgc cccacccggg ctcctctaca cctgccgccg cctgggccga ttccgcgggc 1860 

ctcgcccggc gcttcagccg attcccgccc agctccgggc tcatgggcgc ggtcagcagg 1920 

gcgggccagg gcggcggggc gcgacactgg gaggaagtgc gggccgcctg cccgggcgcg 1980 

ttaaggaagt tgcccaaaat gaggaagagc cgcgggcccg gcggctgagg ccaccccggc 2040 

ggcggctgga gagcgaggag gagcgggtgg ccccgcgctg cgcccgccct cgcctcacct 2100 

25 ggcgcaggta ggtgtggccg cgtcccctac ccggccggga ctttctggta aggagaggag 2160 

gttacgggga acgacgcgct gctttcatgc cctttcttgt tctaccttca tcggccgagg 2220 

taaaagtgct gaaaccatgt gaataaaata caggtgggtt ccgccagctt cgctcctgaa 2280 

cctacccgcg ctcgggatcc agaagctgcg ccgggagaga ggggctcagg cctgggcgga 2340 

ggggacggag gtcagaccgt gcggaaagtg acccgggcac cccagggcgc ccaggccccc 2400 

30 agggagcgcg gaaagtgcgg tcgcggcccg gccctcggga gacgcgggat tgggatcagg 2460 

cacagcgcga ggaagtcgat cttggagcta gaacattttc c 2501 

<210> 63 
<211> 2501 
35 <212> DNA 

<213> Homo Sapiens 
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<400> 63 

cccaaaagat acaaaggggt ataaggtgaa aaattattct aacccatccc tcagtgacct 60 

5 agttcccttc ctctgaggtg accaatttct tgtgtatctt tcctgagata atctatacat 12 0 

atagcaccat atacaagcaa atgaaatatg ttttatttat ttttttgaga ctgggtctca 180 

ctctatcacc caggctggag tgcagtgaca ccatcttggc tctccgcaac ctctgcctcc 240 

tgggctcagg tgatcctccc accttaacct ccagagtagc tgggactaca cgctcacacc 300 

accacaccca cctaattttt gtttttttgt agagacgggg tttcaccatg ttgcccaggc 360 

10 tggtctcaaa ctcctgagtt caagtgatct gcccacctcg gcctcccaaa gtgctgagat 420 

tacaggcgtg agcctccacg cccggcccca aaatctgttt taaaagcaga catttcttgg 480 

tgattctaat aaagggggtt ctcagacata tttggaaaaa tatatcccta cttttatgcc 540 

agaccctgtg ctgggtcccc gggctgtgtg acctgacact gcacagtcct gcttagaatg 600 

cttaaagaga gttaataagg taccaccttc tatgccatag gcggggagca aaggggctcc 660 

15 agtgggccct gcctaggagg cctgaagcta gagctgctga gggcagggct gtgctgcaaa 720 

gaaaatgtct gagagctgca ggcgtttcat cttctgtcat cagctgtggc acctggcaga 780 

cactggatag gcttgtagac aaagacctgg taactcaagg agctgcttgg ccttcctgcc 840 

cagtcccatc ccagaggcac tgtacatctc tggtttcttc agggggccct gtgtggaagt 900 

atcttttgtc ttcctggtgt cagggatatc atcacgtgcc tgttggctag gcgagcccgg 960 

2 0 cgcccagtct cctaggatgg ggagagtaat gttcccgagc agaacagggt ggggctttca 1020 

gactactccc tttcctttac agctggcttc attccatcga cctcatcaaa gccttcctgg 1080 

gagcacccta gagaagagtt acgtccaggc cgggccctgg ctgcctggtt cacggcggaa 1140 

tccccagcac cacgcctcgc acgtcgggct caaagcatgt ttagtgaagg agtaggtacc 1200 

tactgctaga tggagccatc tctctagact tggggtttcc ctataacgat ggctatgttt 1260 

25 ggcatggaag cctctttaga agtcaatagt aggaaataag ggctaacagc acctaattgt 1320 

ggagtaaggt tcaaatccta gctctgccac ttaaccgttc cgaacctgtt ccctcactgc 13 80 

agaggcgaaa aggctaacac tatttcacct cggagggtta ccgtggagaa tggaagctgg 1440 

acaagctgta tcagttcagt agtaaaacac acacacacaa gcgccccacc cccaccccac 1500 

cccaccccag gaatgaacac acacacccgc gcgcgcacat acacctcagg aatgaacaca 1560 

30 cgcgcgtaca cacacacgca gcccccccca ggagtgaaca cacacacaca cgccccgttc 1620 

tgttgttccc aggaacacac acagagacgc acacactcgc ccggttttgt tttttccagg 1680 

ctttttaact ggggtctttc actcggctta gggcaccgct gcctgaaaga cctttctagg 1740 

ccagtcgggg tccggcaccc agttgacgag acagcgcggc gctttcagag ctggggagag 1800 

gcgaaaactc ttccggcccc ccgatccccc ggccagccgc ccccggcagc tccttgccgc 1860 

35 ctcccggcct gggcccgccc agccgttctc ggcctgccgt caggcgatct cggcggccag 1920 

cccagccgcg atgtgacgcc gcgcgccccg gggtcctcgg cgcctgcgcc ctctcctata 1980 
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aagcagacgc 


cgcgccgcgc 


tgcgacgctg 




ttcgctaacg 


cctcccggct 


ctcgtcagcc 




ccgtgagtag 


ccgcccactg 


aactggaaag 




gcctcaccgc 


tgcggtttgg 


gcctgcccgc 


5 


cgggcccggt 


ggatcgcgtt 


gtcgaccctg 




acctgccccg 


cccccgactc 


agcgaggcct 




taagtgagcc 


gaaccccggg 


ctgggccttc 




agctcggctg 


caaggcccag 


gctgcggccg 




ccggggacca 


gggcgggggt 


ggcggcgggg 


10 










<210> 64 








<211> 2501 








<212> DNA 








<213> Homo 


Sapiens 




15 










<400> 64 








gatctgacag 


gttaaaggtg 


tacacttatt 




gatcaagaat 


ggtgcaaagc 


aggatgggga 


20 


aaatgaatat 


aaacatgtaa 


gattttaata 




caagtttctt 


ttttctttca 


aaactcctct 




atagaatttt 


atgtgattta 


tttactcaga 




aacagatttt 


attgtagaat 


ctggaaagat 




tgtttatact 


aacacgtgtt 


tacaacccag 


25 


tgtcatatct 


ctgtatgcat 


gtaagtataa 




gaagaagctc 


tctgatctat 


ttgattcaat 




tagacactgg 


tgatacaaca 


ctgaacggag 




tgttgtcaag 


acatactttc 


caaggggaat 




aggaaaagta 


ccttagtcat 


ctaggagagt 


30 


ctcacgaaag 


acagagattt 


tttgtttggc 




cagggcccaa 


aatagtgctc 


ggc tcataat 




gagtgtttgt 


tttgaatttc 


tgtaatcaaa 




tgctagttga 


aaattttatc 


tcagttgctt 




tacttgtgcc 


atacatcagg 


atgctggaaa 


35 


tctgatattg 


aaaagaatag 


atttggaaag 




cctgcctagg 


aaaactaagt 


aagatgatta 



109 



tagtggcttc 


gtcttcggtt 


tttctcttcc 


2040 


tcccgccggc 


cgtctcctta 


acaccgaaca 


2100 


ggtcgtggct 


aceggattge 


gtgccggctg 


2160 


crcr c ocrcf c era t 


gaetgggect 


ggccttcttt 


2220 


ttcttcggga 


gacactacca 


ggttccgttc 


2280 


cctctggccg 


ggcgtcctca 


cggcgctcca 


2340 


tctgcaccgg 


ccgagcgtca 


gccggcgcgg 


2400 


ggggcctctc 


ttggtcttaa gcctgctgtc 


2460 


ttgtgaatgg 


g 




2501 



ttctctgtaa 


gaagegtcat 


ctggtaagat 


60 


gtttaaaatt 


gtttccaaat 


gtgggaatgt 


120 


taccaaactg 


atcagattct 


gtgtaatttc 


180 


gaaatctgac 


tgtccacaaa 


aacttacttt 


240 


tattatactg 


acctcacatc 


cagtagtgaa 


300 


agagggecat 


ataggttgta 


ttttcagttt 


360 


tttaatttac 


accctgtatt 


gtattattgt 


420 


tatgtgttgg 


caaaggaaaa 


ttttgagtaa 


480 


atgtatttga 


gtgtctaaca gacactgttt 


540 


caccaaatac 


tttacagcgt 


ctcctggagc 


600 


atttcagaat 


aggtgataac 


tagtcaacga 


660 


tgtacttaga 


gtgaactgaa 


ataaactaag 


720 


ttttgtctgt 


tgcattcact 


actgtatctc 


780 


aagtattcag 


caaatatatg 


ttgttgattg 


840 


cacatacctt 


ggtaaattat 


ctttacatct 


900 


tgtttttaat 


gttaccttgc 


tttttgtttc 


960 


agcttattaa 


tattgacagt 


catatggtta 


1020 


gaacctaaga 


ggtcatcttt 


tgttcagctt 


1080 


ggtatgtata 


tttaattagt 


catttaaaaa 


1140 
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aaaaccagga 


caacataatt 


gagttccctc 


ttgagaaaat 


ggagaaaggt 


acttaaccct 


1200 


agctataaag 


ggactaacct 


ggaaatttta 


gaacttctgt 


gtgggaaagt 


ggaaaaaaaa 


1260 


aaaaagcaca 


actaagctgc 


tctttgttga 


tatcagaaat 


gggcctgtca 


ttcattttgg 


1320 


cattgaagca 


tagcctccta 


tctcggggca 


ggactgggac 


atttttttcc 


tcccacaaga 


1380 


gctggacagt 


tattacaggt 


tcaaaaagcc 


ccgaccagtt 


tttcaagagt 


ttctcctcct 


1440 


cttttccccc 


tgaaactcgt 


ggtgcttttg 


ctctgctttc 


aagatgcatt 


aagtctcctg 


1500 


ctttgtgact 


gctttggagc 


cagcagatac 


tctgatatgt 


ataattcaaa 


ttatgcaggt 


1560 


ttcacgagta 


agtttaatct 


tattttttaa 


gttagttaaa 


aggcaagtga 


tatttagaaa 


1620 


aatgttaact 


tgtagttatt 


tcaccctttt 


tactttaagc 


atttttattg 


cttctcggcc 


1680 


ttttggctaa 


gatcaagtgt 


gtactttaag 


cattttttaa 


aataaaaata 


tccttttaat 


1740 


ttaataagaa 


aacaaggttc 


tacatagaaa 


agccccttca 


tctaagacct 


gcacttttca 


1800 


atttcttttg 


agatgtcttt 


gttgtaaaca 


gtattcatat 


gtcttttgaa 


agccagttaa 


1860 


ctaaacagtt 


ttcttgagca 


tctttttagt 


tttactgaga 


agtattttaa 


attgagcttt 


1920 


tctgagctcg 


attgcttacg 


tctgacacag 


tctcaagttt 


ccactgaatg 


gtaacaaaga 


1980 


ctgtagaatg 


ttgttggtac 


tgcagtgaga 


ggcatgcttc 


cttagaccag 


gtaagagaga 


2040 


tcagtttgtt 


tctcactgct 


gggtgagttt 


ttacagctct 


tattttatat 


tctttaagca 


2100 


gcagcaatat 


taaattgata 


aatagccagg 


agcacgctga 


tttcaagacg 


tccttgcttg 


2160 


ttgcagacag 


aaaaactaca 


gggttatgta 


tgggggttgg 


ggtggggggg 


gaggggaaga 


2220 


attagtttat 


tactcagtta 


cttatataaa 


ttaattaaaa 


tgtgaaaata 


attctggagc 


2280 


tcagttttct 


taattcagga 


actaaagcag 


cagttgagga 


aatcagtaat 


tttaaaggta 


2340 


cttcatggtt 


attacttgtg 


aaagcaattc 


aaaggatagt 


ttttactttc 


atttttttcc 


2400 


ccagtagtta 


ataaaataag 


ctttgccctt 


aactaaacat 


tttttccact 


tacgaaaact 


2460 


tttaaattgc 


caacagcaaa 


atatacttcc 


caaggatcct 


t 




2501 


<210> 65 














<211> 2501 














<212> DNA 














<213> Homo 


Sapiens 












<400> 65 














cacaagtcaa 


gaccgctccc 


tgcttcttag 


cccgctgggg 


agccaggcca 


gcaggcccca 


60 


cattcctgag 


gaagggacag 


ggttctggcc 


tggagggtct 


agcagaagcc 


accccagggg 


120 


agggcccgac 


aggaaggaag 


gtaggcctgc 


cggaggggca 


tacaggagct 


tcctctcccg 


180 


ccacagtgtc 


cagggccaac 


tgctccagcc 


ctcaggctgg 


gtcaacagga 


tgggacagcc 


240 


caggcggaag 


gaaacctgtg 


gggagggaca 


ccccgcagac 


agaagcaggg 


acatggggtg 


300 
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gggagaggca 


ggaagagctg 


ccgggctgct 


gagctggcgc 


ctctccagca 


gactcaggag 


360 




gggcggtgac 


aggaggccat 


tccctcctca 


tccccgcagc 


cctgggcctc 


tctggtcctg 


420 




gccaacagta 


ttactatcat 


tattattgct 


gttgttcgct 


agcctgggcc 


ttagatacat 


480 




tagaaaaaaa 


ccatcggaag 


atacgcatag 


cattggcagt 


ttctaaaaga 


attaattccc 


540 


5 


ttcctgtgtt 


cattctgtga 


ttactgggat 


agaaatgcta 


tttgcattac 


cagcctttca 


600 




ttcagttaca 


gagacgtgag 


tgctcgaagg 


agagacagtg 


atttttgcct 


taaattcagc 


660 




ctgtccaaat 


cggataagat 


ctccgatttg 


ctttaagccc 


cgttatcact 


gccttcctct 


720 




ccaacaacag 


ctgctgtgat 


cacgcacaaa 


cggccaaacg 


ggggcaaatc 


cgtgccaaag 


780 




cagggccatg 


ggctttcctg 


atcagaaggc 


ctagccccag 


cccccaggcg 


cagcacacgg 


840 


10 


gcggcttcct 


ttcagaaacc 


cagcctgcct 


cccaccagct 


ggagtgggtg 


ggtggggcgg 


900 




tagtggtgcc 


agtttcaggg 


aacggccggc 


aaacccacct 


ccaggcgtgc 


tccagcggga 


960 




gcctggagac 


cctaggagag 


ccctccccac 


aagcggcttc 


caggcaggac 


gcttccagag 


1020 




gtcttggtcc 


aggggtgggg 


gtgaggtggg 


gtctaccttt 


gaaacagcta 


caatttaaac 


1080 




ttcagctaca 


ccgagctcaa 


actcgattcc 


gcagccgagt 


gtcggcgcca 


gagaaggata 


1140 


15 


aaaactcggg 


tctacggctc 


cccaccacgc 


ccctggtccg 


gtcctctggg 


cttccaggag 


1200 




tcctcacgcc 


atcctctggg 


ttgcccagga 


ggaaggatgg 


gcggggcggg 


caggcgctgc 


1260 




gggcgctgca 


gatggggagg 


gcgagcccgc 


ggcacggcgt 


gagcggggga 


gaggcgcgcg 


1320 




agcaggtgtc 


ggctccgtga 


cagggtcccc 


catcccgcgc 


cccagtgctc 


cccgaggctt 


1380 




agtgaggcaa 


aacccagcaa 


atgcttcaga 


aatgcagctc 


agtcggtcac 


cgggttctgc 


1440 


20 


ttcctcatca 


gacgcgcaag 


aggatggcgc 


ttccaatgca 


aatctcttgg 


ctccggcccc 


1500 




ttggctggca 


gccgccgcgt 


cccccgcctg 


cctggcgtcc 


cgcccactcc 


gtggcgggct 


1560 




gagacgaggc 


ccggcgcgga 


ggggacgggg 


cggagcgggc 


atccctcccc 


accccccacg 


1620 




tggggctggc 


cctccgcagt 


gcctgggcgc 


gctgcagtcg 


ccgcgcctcc 


ccggccgcgg 


1680 




caccgcctct 


ctaggcaggg 


gcgggggacg 


aggggcaagg 


agtgggcgag 


gggtgggcga 


1740 


25 


ggggcggggg 


gcgtcactca 


atcaggtggc 


ctctggagtt 


ccccggggca 


gggcagaggg 


1800 




aacacgctgc 


cggggattgt 


gtacacgctc 


cactgacacc 


agcttcacgc 


tgccgggcag 


1860 




tcgccgatca 


cgcgtggccc 


cgcgagccca 


ttggccggcg 


cctcacacac 


ctttgccgtt 


1920 




gattggccgg 


cctcaggctc 


cgcccccacc 


cccgcccgcg 


gcgcggggca 


ggctgagcgg 


1980 




ctacctgaat 


ggggaggggg 


cagacggcgc 


tgagcgcggc 


ggcggcggga 


gcggcgtcga 


2040 


30 


gtgtctccgt 


gcgcccgtct 


gtggccaagc 


agccagcagc 


ctagcagcca 


gtcagcttgc 


2100 




cgccggcggc 


caagcagcca 


accatgctca 


acttcggtgc 


ctctctccag 


cagactgcgg 


2160 




taagtcattt 


ggggatgccc 


ctgtgcttcc 


tcgcctggtc 


ttgtctgggg 


ggccaaaggg 


2220 




ggcgcgaacc 


ccgagccccg 


gacatcagcc 


atgcctgaga 


attggggctg 


cagcggagtc 


2280 




gtggggaagg 


aaagggcttc 


ctgcctgcag 


actatgggca 


ttagtgaggg 


cgtgtgtgtt 


2340 


35 


ggggaggggg 


tcgaaccagg 


gggctgggat 


cttcagacag 


ggacaggggt 


cttgctctag 


2400 




atgtactgag 


gggaagggac 


aactccgcat 


ggagacccga 


gagggctggt 


gaggaggagg 


2460 
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atgacgagcg ggggaggagt ggggaggggg ccgttgccct g 2501 

<210> 66 
<211> 2501 
5 <212> DNA 

<213> Homo Sapiens 

<400> 66 

10 ggggctgtag aaatggcggc cccatctccc aacaacttgg gcattgtgaa tatcacctcc 60 

ttaaagggga tctcctttgg tcatcccgtc tagagcagcc accataactt ctgagcgttt 120 

attgctagct gatatatatc agaaaaatac aaattccaca aaagcaggga ctggtctgct 180 

tctctccctg cagggcccag gttctggcac atagttggtg cagaaagtgt gcagcctcag 240 

gtcctatcca agcccccagg gcatcacact cgggacttgt tctgcatatt tttacttttg 300 

15 cctcccactg gtactagttc ttccgtggaa cagcctgagt cccttcagat acttaatgtt 360 

ttttctcaag tgctgccatg aagccagatc tccaccgtct tggggcattc ctttttaggg 420 

atgggaagta tatgtcgctc cttttatgtg atttacattc tatcttggat aatttggcca 480 

tcaccgtagt tcattcagat ctgtttggat cctgcccatc tcagcttcag tccatttcat 540 

tcttttaaat ctgatcgaca gttacctcca acagcttcat cacaaatcac tcacaaaaat 600 

2 0 ggccttaatc ctgaagttta tttacggaga gcacacttgc taggtgtgtg gcagatatac 660 

aggaagcaca agatgaggca gcagatctag aggcaaatga cttccttctc cctgcctagt 72 0 

ggtgactgcc agcatcacgc cctcccggga gaggtgagaa acccctccac gcaagcactg 780 

gaaccttcac agtcaagagt ggcaacagct ccggttactg gacttgggcc tgttgaattc 840 

taatactctg tgactccaca tctgggctga atttttgctg agtatgatgg aatttacatg 900 

25 cttcctccct agcccctact tgtctgtata gttggaatat ttggttgcct cctctggagg 960 

gatctagtac gtttagagtc tagacgctgg aactgtcaaa gttcagagga aagagctcca 1020 

gctgcaaagc aagagaaatg ggctggaatt ctagcttcac cccttaatga atgcttctga 1080 

tttttttttt tttttttttt ttgagacgta gtctcactct atcgcccagg ctggattgca 1140 

gtggccacga tctcagctca ctgcaacctc cgcctcccag actcaagcga ttctcgtgcc 1200 

3 0 tgagcctcct gagtagctgg gattacaggc gtgcgctacc acgcccggct aatttttgta 1260 

tttttagtag agacagtttt tggccatgtt ggtcaggctg gtcttgaact catgacctca 1320 

agtgatctac cttcctcggc ctccgaaagt gctgggatta caggcccgag ccaccgcgcc 1380 

cagccgcttc tgatcattaa aaaaaaattt tttttttggc ggggggaacg aagtgtccct 1440 

ctgttgctca ggctggagtg cagtgcagtg atctcggctc actgcaatct ctgcctccca 1500 

35 ggttcaagcg attttcctgc ctcagcctcc tgagtagctg ggaatacggg tgccccccac 1560 

cacacccagc taatttttgc atttttagta gcgatggggt ttcgccatgt tggccaaggc 1620 
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tggtctcgaa 


cttctggcct 


caggtgatct 




tacaggcgtg 


agccaccgtg 


cctggccaaa 




gcagtagtga 


gaagggggga 


aagagtagag 




tttgaacaag 


ttacctaatt 


ctctgaggac 


5 


atttgcaggg 


ttgttgggag 


gaataagtga 




cagaaggtgt 


tcaattaatc 


tgcaatcatt 




ccatccatcc 


tccgagtgtt 


gccaagttat 




taaggcttct 


ggctgccagc 


ggcgaacctc 




tgaaatgcga 


ccgggtctct 


ttaagggcca 


10 


tggactagca 


gtcgtccgcg 


ccgactcgca 




tcgcccggct 


atgctgctgt 


ggccgctgcg 




tgggccggga 


agtcgcggga 


gcccggcctc 




ggcctggcct 


cccggtaacg 


cgcgtcttgg 




cctactcccg 


gcccctcggc 


ttccggaacc 


15 


gcgggatttc 


ctccccggct 


gcggctggga 
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20 


<213> Homo 


Sapiens 
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atggtctcga 


tttcctgacc 


tcatgatccg 


25 


acaggcgtga 


gccactgtgc 


ccggcctcta 




tttttttttt 


gagacagagt 


ttagctcttg 




tcggctcact 


gcaacttctg 


cctcccaagt 




atagctggga 


ttacaggtgc 


ccaccaccat 




acagggtttc 


accatgttgg 


ccagtctggt 


30 


gcctccacct 


cccaaagtgc 


tgggattaca 




cagcattttc 


atactgaatg 


tccacagctg 




ctgactcaga 


gagaagcagc 


cacatgtccc 




gagctgggtg 


tcttagctca 


catctgtaat 




attgcttgag 


gccagaagtt 


caagaccagc 


35. 


aaaaatttaa 


aaattagcca 


caaaatttaa 




gctcacgcct 


gtaatcccag 


cgctttggga 



113 



gccttccttg 


gcctcccaaa 


gtgctgggat 


1680 


aaatttatgt 


tttaaaaaga 


ctagtcaagt 


1740 


caaggagtta 


tatctgttgc 


ttctgaccat 


1800 


aagctcggag 


aatgggagag 


acagttatct 


1860 


catcatgagt 


gtgtgccagg 


tgtctgatta 


1920 


aattaaccct 


tcagtcgctg 


gtattatttg 


1980 


gggtgcgttc 


tgccagcgtc 


ctagcagtgg 


2040 


tcccttcgag 


tatttctcct 


cttgctgaga 


2100 


ggcgccggga 


tccaggcggc 


gcccaacggc 


2160 


caagaaggaa 


v*« y X#» ^ 


ctcr crate cere 


2220 


gggctgggcc 


gcccgggcgc 


tgegctgett 


2280 


aggccccggg 


ccgcggaggg 


tgcagcgccg 


2340 


tcccgcctcc 


caggagcccc 


tatgcgccca 


2400 


cgcccgagcc 


cgaagcgcct 


cttccgaggc 


2460 


cgggggcggc 


c 




2501 



cccacctcgg 


cctcccaaag 


tgctgggatt • 


60 


tcagcatttt 


ctttcttttt 


ctttttcttt 


120 


ttgcccaggc 


tgaagggcaa 


tggtgtgatc 


180 


teaagegatt 


ctcctgcctc 


agcctcctga 


240 


gcccagctaa 


tttttgeatt 


tttagtagag 


300 


cttgaactcc 


tgacctcagg 


tgatccgccc 


360 


ggtgtgaaag 


agaccattcc 


cgatctcttt 


420 


ccctgtgagg 


aggcttttta 


cccatatttt 


480 


ttggccatgg 


cagttaagac 


caactccatg 


540 


cccagcactt 


tggaaageca 


aggcaggatg 


600 


ctgggcaaca 


tagccagacc 


ccatctctac 


660 


aaattaacaa 


caaaagggee 


gggtgcggtg 


720 


gggtggatca 


cgaggtcagg 


agttcgagac 


780 
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35 



cagcctggcc 


aagatggtga 


aatcccatct 


ctactaaaaa 


tacaaaaatt 


agccgggcgt 


840 


ggtggcgggc 


gcctgttgtc 


ccagctaccc 


aggaggctga 


ggcaggagaa 


tcgcttgaat 


900 


ccgggagtct 


gaggttgcag 


tgagccgaga 


tcgcagcatt 


gcactccagc 


ctgggcgaca 


960 


agagcgaaac 


tccatcttaa 


aaaaaaaaaa 


aaaaaaaagt 


ggaagatgag 


gaagttgatc 


1020 


agacatcaag 


gatgagcgga 


tgacttaata 


ggcttctttg 


ctaagacttg 


gctgggcagg 


1080 


tgaaagacaa 


agtcgaggag 


tggttatggt 


gtggcacaga 


agaagggtca 


gaggacggtc 


1140 


tttgttacct 


cttcatgcct 


gagtttcttc 


ctctgtgaaa 


tggggataat 


aagagccgcc 


1200 


atacagggaa 


ttgctgctag 


gatcaaatga 


gataatgtat 


gtgaaacgct 


ctggctgtag 


1260 


gcttctcagc 


aaatgggcac 


gacttgcgga 


gtggggattt 


gaattcacgt 


ctggcgggat 


1320 


gtccaagctg 


ctaccctgac 


cgctagggag 


cttcagagga 


cagggctgca ggtgatcagg 


1380 


aagaggactg 


gggcaggtgg 


gcgaggaatg 


cctcccagga 


gtgaaggagg gggaattcta 


1440 


gtcagcagga 


tggagtcggc 


caggtagaaa 


cgagggaaag 


gagacaggac 


cggatggaac 


1500 


ggggaagcca 


aagggcaggg 


cgtcggaggg 


ttgaatggtg 


gccggtgcag ctttgaacac 


1560 


cgaggtgagg 


acatgcagct 


gtgtcctagg 


gtcaggaccg 


tacacgcctg 


acccaattcc 


1620 


acagcacgga 


ggggaactcc 


aggatccggc 


cgcgttgccc 


acacacttcg ctctccctcc 


1680 


cgcctctcgc 


aagcccctcc 


cccgtctccg 


tccaccgagt 


gccagccaat 


agcagaagcg 


1740 


acagcgcatc 


tgggtgccga 


ctcagccaat 


cgcggctgag 


tgacgaatga 


gccccaggac 


1800 


caatgagagt 


gccgccacca 


tggcaaaaaa 


aaaaaaatcc 


aatggtgacg 


agcagggaga 


1860 


acagagcagc 


tgccaatggg 


cgtgtgcgtt 


tcaggcggcc 


aatgggagga 


ggcgtctcgg 


1920 


cgggggacaa 


gcagtagcta 


cccgcgggag 


cggggagggg 


tccgggttcg 


agcttgtgtt 


1980 


cccccggaag 


ggtgagtctg 


gacgcgggcg 


cggaaggagc 


gcggccggag 


gtcctcagga 


2040 


agaagccgcg 


gggactggct 


gcgcttgaca 


ggctgcactt 


ggatgggagc 


acctggtgcc 


2100 


tcgggactgc 


tccgatgccc 


ggtgggtgca 


catcccagtt 


cccgccgttg ccggccgggt 


2160 


ttagaggttt 


tggggggagg 


acatggqagc 


gtgcagcctt 


cccagttgca 


aacttcactc 


2220 


cgaccctgtc 


ttcaaagctg 


ggtctgggtc 


cagtggggac 


gagaaaggag 


gaaggaggaa 


2280 


gtaggctccg 


cgaaagcccc 


atccccggga 


tctcatctat 


aacatgaata ggtattaatg 


2340 


gcaaaggcta 


attaagcgct 


tactgtatac 


caggcacttt 


ctctgcctcc 


tcgcgttaaa 


2400 


tcctcccagc 


agccttttga 


ggtagacact 


gttacatgcc 


cattttccag atgaggaaac 


2460 


cagcaacatg 


ggtggaagtg 


acagcccctc 


cacttccata 


c 




2501 
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ggagtgcaag 


aacacagaac 


taaaacagag 


cttgaaactt 


aaagaaaggg 


agagacttgg 


60 




gggaggagtg 


gggtggagtg 


acgtgatgtg 


ctgctggaaa 


ccagcagttg 


gtggtttcct 


120 




cttgtgcttc 


ctcttctgtg 


ggttttctcc 


tgcttgtggg 


agggcctttt 


tctctcctcc 


180 


5 


cgacagaaag 


gctatctttg 


gtgttcgttc 


ccttgaactg 


taacatcctg 


taagggtatg 


240 




attccatgcc 


tctgtgtggg 


tgtgaattcc 


ctcatggtga 


ccctcaaaat 


ctgcacacag 


300 




gaccccttcc 


cattgagggg aggggatcaa 


aacaactcta 


cttctcaggg 


tcctctcctg 


360 




ttccaactgg 


tctgtgtcca agagaagcct 


taggtaaatg 


gggccagctt 


gaagatcaaa 


420 




caggtttggc 


agcctctccc 


ggcctctctt 


ttctctccta 


cagctttata 


gctacagctg 


480 


10 


ccttgatatc 


aatattgact 


ttggctggct 


ggcatgacta 


cccacagggt 


atcgtgcctt 


540 




aatttaccag 


gtgacaggca 


acgctgccct 


ctcctggaac 


catccagcag 


agccagggct 


600 




gtacccccaa 


atcctgcaac 


agaggtttcc 


ctccatctca 


cctccctgtc 


cctgcatttc 


660 




tcctatctca 


gtagctcctc 


tttccctctc 


tgggcttctc 


tttccactcc 


ctccccttcc 


720 




tgggcttggt 


aaactagtcc 


ctaatctctt 


cacaccccag 


attggaaggt 


gggtccctcc 


780 


15 


ctgacactcc 


ccagagctgt 


caccaacctc 


ctccaagttt 


ctatagctcc 


attgctcaac 


840 




agatttgcca 


ggggtaacca 


ttaacccagc 


ccttaactct 


gttcccccac 


ctttcttgct 


900 




ggaggggatt 


ttccaattac 


tggttagcac 


agctaggtca 


tctcaccccc 


accatctttc 


960 




ctaacttctt 


gggttggggg 


gctggggagg 


aatctcccca 


tctcagggta 


ctaggaacaa 


1020 




agctggggag 


gatggtgcat 


ttaaagggat 


tatatatata 


tatatatata 


ttttttttct 


1080 


20 


ttctccctca 


taaccccacc 


cccgcaacac 


acacacacac 


acacacacac 


acacacacac 


1140 




acacacacac 


agacgcacaa ataagcttta 


tggagcagtg 


acttcattat 


gttcaccgct 


1200 




ttgagtccaa 


cccctggccc 


aaaataggca 


ctaaatagtt 


gccgaatgca 


tgaatgatag 


1260 




atacctctct 


gtcttcaggg gtgtgtagaa gtgcgaaggg 


gtatgggcat 


gtcccagtag 


1320 




gggtgtgagt 


gttctgatca gaactacttc 


tctctgccag 


aatttgatgt 


aattcgaatg 


1380 


25 


cttccacctc 


tgcttgaagg gtttaaataa 


taaattaggc 


cctgtcgtgc 


cattatgggg 


1440 




gtggtcatac 


cctgtaccca ggaaacaggc 


acggtagggc 


tgagacagaa 


gtcctgcttg 


1500 




tttccgctta 


tttatttgaa acaccgctca tttaggtctt 


actttgtttg 


ccaggcactg 


1560 




ttctaagctc 


tgtataaata 


ttaactcaga gggtacaaat 


attaacttaa 


gagttgttgc 


1620 




aggaaaaaaa 


ataagcgcct 


ctggctcttt 


aagtttggcc 


tccccctcaa 


aacccccgca 


1680 


30 


acggtcccaa 


accccttcca gggactggga ctacggaccc 


tggtccgacc 


ttctcgcggg 


1740 




cttcccactg 


cgccaatcaa 


atcccagaaa 


cagtgagtgc 


tagaggcccg 


gctgctaagc 


1800 




aacggcagag 


ggcgggaagt 


ttgaacgttc 


tggacccgcc 


ccgaaggcaa 


ataggccaat 


1860 




cagcgtccag 


actcttcagc 


tacggcagtc 


cgcttctcct 


cctcgccctg 


tcggatctct 


1920 




aggctggatc 


cgggcctctc 


caatcaacag 


cggctaggag 


ggcggggcgc 


gtgcgcgcgc 


1980 


35 


acctcgctca 


cgcgccggcg 


cgctcctttt 


gcaggctcgt 


ggcggtcggt 


cagcggggcg 


2040 




ttctcccacc 


tgtagcgact 


caggttactg 


aaaaggcggg 


aaaacgctgc 


gatggcggca 


2100 
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gctgggggag 


gaggaagata 


agcgcgtgag 


gctggggtcc 


tggcgcgtgg 


ttggcagagg 


2160 


cagagacata 


agacgtgcac 


gactcgcccc 


acagggccct 


cagacccctt 


ccttccaaag 


2220 


ggtaacctcc 


gcgtgacagg 


aatgagggtg 


gggcgcgtgg 


agtttcccac 


aatctgtact 


2280 


ttagttaaat 


acccgagaat 


tcacctcctg 


tgtccacagc 


tctccacgcc 


cctcagccct 


2340 


gccccgcagc 


cctgtagcag 


aagtacttag 


tgctttgcat 


tctgcgcgcc 


accctacccc 


2400 


ggcctcctct 


gtgaatcgtt 


gcttccgaac 


cgccctcact 


ttttgcatcc 


gcaga 


2455 
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Sapiens 












<400> 69 














ttttaaacga 


gaagtgatgt 


ttccggagca 


ttaaaactga 


agtgatttca 


aaaccatgtt 


60 


gcactcacac 


gaacaggtgt 


gcacttaatg 


gactaaacta 


gttcagctga 


catgtcttct 


120 


tcattaggaa 


cagtgtggag 


actgaaaaac 


taatttagcc 


tagagcagct 


atttaattgt 


180 


aaagtctcct 


ttctcaaata 


ttgatttact 


atgtgaggaa 


atatttactt 


tgtatagaag 


240 


tgtgtggaat 


tggacgaggg 


ggttgaccta 


cacatgtggt 


ttggtataca 


catatcctca 


300 


ttacagaggg 


tgtaatgaag 


atataggtgg 


ttcagcacca 


taggaaaggg 


aaaaaagaaa 


360 


aaaaaaagac 


ggtagaggtg 


gcctcccaag 


catccactcc 


cactcctctt 


gttaatgatt 


420 


cacaatttgt 


tgttattgtt 


gtcatttact 


gttctccaca 


cctttccaca 


aggcctgtgt 


480 


gctttgaaaa 


aatatgtctc 


tactccggat 


agaagtgggg 


cacacagggc 


caggcgcggt 


540 


ggctcacgcc 


tgtaatccca 


gcactttggg 


aggccgaggc 


aggcagatca 


caaggtcagg 


600 


agttcgagat 


cagcctggcc 


aatatggtga 


aaccccatct 


ctactaaaaa 


tacaaaaatt 


660 


agcctggcgt 


ggtggcacgt 


gcctgtagtc 


ccagatactt 


gggaggctga 


ggcagaagaa 


720 


tcacttgaac 


ccgggaggca 


gaggttgcag 


tgagccgaga 


tggtaccact 


gcactccagc 


780 


ctgggcgaga 


gtgcaatgag 


actccgtctc 


caaaaaaaaa 


aaaaaaaaga 


aaaaaagaaa 


840 


agtaagtggg 


gcacacgatt 


caggcctaag 


ctaaccagac 


caacctcatt 


cctgatggtt 


900 


gttaatgttt 


cagatacggg 


cccgcagccc 


tacgtagaga 


agaggccaag 


gtagaaaaca 


960 


tgaatctgag 


gtaaaaagaa 


atgaggtact 


tgtttgcctc 


atcaagcctc 


tcaattaaac 


1020 


taaccttgaa 


gcctgtctta 


cctttggact 


tctagtgatg 


tcacccggta 


aagcccattt 


1080 


gtttcaggac 


gtaagagttg 


ggttttctgt 


gacttggaac 


caaaaccatt 


ccaatttaca 


1140 


aaatgagcaa 


ctttaatatt 


acccatgaga 


aatacttcat 


tggtatatgc 


tctttcctag 


1200 


cgtttttgaa 


aactaaacta 


ggtgggtgaa 


aagtatatct 


ttgcatgaaa 


ctttttcatt 


1260 


ccagaaaaca 


ttttgtcatc 


ttgataataa 


tggccaatgc 


tactatatcc 


aaatttttgt 


1320 
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30 



cttttttttt 


ttttgagaca 


gagtctcgct 


ctgccgctca 


ggtgtgatgg cgcgatctcg 


1380 


gctcactgca 


acctctgcct 


ccctggttca 


agcgattctc 


ctgcctcagc 


ctccctgagt 


1440 


agctgggatt 


acaggcatgc 


gccaccacac 


ctggctaatt 


tttgtatttt 


tactgtagac 


1500 


ggggtttcac 


cattttggcc 


aggctggtct 


cgaactcccg 


acttccagtg 


atcctcctgc 


1560 


ctacctcaaa 


aagcaacttg 


ataaatccac 


aggctcggta 


tattttaaaa 


attcttttaa 


1620 


atacagtata 


cttttctctt 


tttttccaga 


attaaccatg 


aatcgcacac 


acagccagag 


1680 


gcttttaacc 


cgagaacgga 


caaaggggcc 


tgcttgtgca 


atacaattat 


ttttaatggt 


1740 


taaacaaatt 


aatacataag 


accagcttta 


cctaatataa 


taataacgaa 


ccaaagttta 


1800 


caacagacaa 


gaaaagcacc 


agctgtcccc 


gccaccccgg 


agcgatctcc 


aaggggacgc 


1860 


gggagagcgc 


cgcgggggac 


gcggaagtct 


gacgtcacag 


gaactggggg 


cggggcgggg 


1920 


aggcccgcac 


accctattgc 


gcatgctccc 


gcctccccgg 


ccgcggcctg gcgcagtgcg 


1980 


cacgcgcgcg 


ggtgggcggg 


tttgactggc 


cgtagagtct 


gcgcagttgg 


tgaatggcgt 


2040 


tggtggcggg 


aaagttgagt 


ctctcctgcg 


ccgagccttc 


ggggcgatgt 


gtagtgcctt 


2100 


ccatagggct 


gagtctggga 


ccgaggtgag 


agccgccggg 


ttgggagtga 


gggagatggg 


2160 


aacaaggccg 


ccggtgggcg 


aggggagccg 


agggaacccg 


ggggattggg 


aggcttgggg 


2220 


cggcgcggcc 


tggccgggct 


gggaccggcc 


tctcggccta 


gacgcccgcg 


atgctggcac 


2280 


cctctgccac 


ctctcacctg 


ggccccaggg 


gtccgcccct 


gggcagcctg 


gagtcctccg 


2340 


aggtgggagg 


accgggcgga 


ggtggaggaa 


gtctttcttt 


ggaagacttg 


ctgcctgccc 


2400 


agatcgatat 


aacatacgag 


gtctctcctc 


ccaagagtta 


tggtctaaaa 


acccctcaca 


2460 


aattaactac 


cgttggaaat 


gtcaagctat 


gcaagaaaag 


ctagaaaagg 


ggaggggtcg 


2520 


cccgttggag 


catttggagc 


ttttctggaa 


caggtggtgt 


ttgcggaggt 


tgcctcacct 


2580 


ccctgtagcc 


cacgtgtctc 


tgcttagggc 


agctggccct 


cgcca 




2625 
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tagtcccagc 


tactcgggag 


gctgaggcag 


gagaattgct 


tgaacccagg 


aagcagaggt 


60 


tgcagtgagc 


tgagattatg 


ccactgcact 


ccagcctggg 


caacagaggg 


agactccatc 


120 


tcaaaaaaaa 


aaaaaatcat 


taaaatacag 


taattcaggt 


ttattaagtc 


attaccattg 


180 


ggttacctca 


caaataaact 


aagtttagat 


gcgaactcaa 


agatactgag 


acactaatcc 


240 


atttcttaag 


ctgctaagtt 


agccttcttg 


aaacctcact 


tcgtagctct 


gcaaacaatg 


300 


tacttttgac 


atcccaagct 


cacaggaata 


aaaaaccacc 


tgccagttgt 


ttccgttttc 


360 
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cacctatgtc 


taatttatgt 


acttatattt 


ataagaaaca 


aatcactaag 


tcttatttca 


420 




tccttagtta 


tgttgtgttt 


ctatcgataa 


cagcatgaag 


atttcgggga 


cctggacatt 


480 




aaaataagtt 


tgagtactgg 


ctttacaatc 


tactaggtgt 


gatccgaggc 


aagtcagtct 


540 




cttcatgttt 


cacttctttc 


acttgtaaac 


atctattcag 


aagttgctgt 


gaacttgata 


600 


5 


tttccatgct 


tataaactga 


ttttttgaaa 


agagcctggt 


acataggacg 


tgataataaa 


660 




tgaaagcatt 


tgctactttt 


ggaaaaacaa 


gcatgacaag 


atagtttata 


tactgttgat 


720 




cttaagcaca 


gtatatgcat 


cttattttta 


gctagtctga 


cagtgagata 


ataaaaagag 


780 




ttatctttga 


cttgcactac 


gagtagaaga 


attcaacttc 


agtttctaga 


aagatgtata 


840 




agaattaaga 


gtggcagtct 


tcctagtctc 


aactgccatc 


ttcccaccag 


gtggtaaatt 


900 


10 


cgtccagaga 


agaaaatgaa 


ttattgctat 


atgggattct 


gcagcaactt 


ctgtgaacat 


960 




aggctcataa 


tttttcacca 


tggagactca 


agctttttgg 


agtcatagtt 


gtttttgggt 


1020 




ctatttgcag 


gcatgcatcc 


tttgtccaga 


aatatacata 


acatttggca 


catggacctg 


1080 




gaggtaaaag 


aggaggaagg 


cctgaggcta 


gacaccactc 


caataagtac 


attaagctcc 


1140 




tagaagggca 


atccaccttt 


gcagagaact 


cttaactatt 


aaaacctata 


gcttgtaaag 


1200 


15 


cagcattttc 


aaagttaaga 


gaagaaggtg 


gaagggtctt 


gagaggctac 


tgactaaaca 


1260 




gatgaaaatg 


aaggtatgga 


gtttggtgcc 


aaaagaaact 


ccccccaaaa 


atcaaacaat 


1320 




aacaccagag 


taaagcccct 


agggcgagat 


aaggagttgc 


aacaaaacaa 


gcggaaactc 


1380 




gagaagcgct 


aatgcttcaa 


agggtcaatg 


accacacata 


atctacgtag 


ccaacgtgtt 


1440 




aaaacacacc 


aacgcatttt 


tttttcctaa 


acaaagtagg 


aaagcggact 


ttgcatgagg 


1500 


20 


ggcgggctgc 


cgacccagca 


gtcttcctcg 


gacagtccgt 


cctgattctc 


tctggttggc 


1560 




cgtggaggga 


ccacatggct 


ccaaggcctc 


tcagctccgg 


gcccacacac 


cccgggctgc 


1620 




cgcacaaact 


ccagccctag 


tctagatcca 


caaccccttc 


tcgaagatca 


accgcgacct 


1680 




gggagcccca 


cttcttacca 


tagcgaggcc 


ggcgatgccg 


cagccacatc 


acccttccgg 


1740 




ggctcaggcg 


gaagaggctg 


catgtcccgt 


ctgcccttct 


cgccctctcc 


agccgtccgg 


1800 


25 


ttgggcttgt 


cacggcaccg 


cctaccaaga 


cgggcggtta 


agacactagg 


ataggctcct 


1860 




ctccaccgga 


aaaggcggga 


tttagatcac 


gtcccgcagg 


ccggcggaag 


tagctgatac 


1920 




tctcattggt 


tgcaaaacct 


tgatctgtga 


aagcgggcgt 


tttggaagat 


accggaagta 


1980 




gagtcacgga 


gaggtaggat 


ccggaagtgg 


ggctgcctct 


ttaaataaca 


aaaatctgag 


2040 




gttctgttct 


ttttatcttt 


ttgctttctt 


tttaaaaaag 


ttccctgcta 


cttaccccta 


2100 


30 


gaactccaca 


atgcgagaat 


ccccctcaat 


ttgtgagctc 


ccgcgacttc 


ctcttgtggg 


2160 




cttttgggga 


tgctagggtt 


ctcggcatta 


tcctcagggt 


gcgacctgtt 


cacccccttt 


2220 




tcagtttctc 


cgtttgcatc 


tgagggattc 


ttgggaatgc 


gaagcacttt 


tgaaatgctc 


2280 




tgtgttggtt 


gtgggattgg 


gaggacggtt 


gaatccagag 


ggtagtgttg 


agtaggctgt 


2340 




ttgagcattt 


ccccagcact 


ggcctgtcct 


ttcaatcccc 


agatattggt 


aaactgtggg 


2400 


35 


ttccaaccag 


gcatcgaggc 


tgaaacgtac 


taggcaattt 


gaggtcagga 


aagaactttc 


2460 




tgtggtaacc 


aatgggaagg 


aactgccgtt 


tgcggactgc 


agcgattgat 


taggtacttt 


2520 



1386-WO 

119 

aaagagatca actggcaaga 2540 

<210> 71 
<211> 2610 
5 <212> DNA 

<213> Homo Sapiens 

<400> 71 

10 ctacaggctc gtgtcaccac actgggcaat acaaaaaata caaaaaaaaa attttgtatt 60 

ttttgtagag acgaggtctt gccatattgc ccaggctgga attcttacct ttgttactgt 120 

atttaacgta tctttttcct ccggccatct tcatggtttt ctctctgatt tccacagttt 180 

gaatacactg catgtgtcag gcaggggctc atatttatca agttttgtgt gtgctctgag 240 

ctcaggtctt tcattatttt gggaaaatta ttggtaattt tctcttcaaa cattttttat 300 

15 gatttgttct ttcttcttct tttgggagtc ctattacatg catatgatat catttgatat 360 

tttcccacag ttcttggatg ctttttttaa aaaaaaactt tttttcttct ttattttcca 420 

acgtgggtaa ttcctatttt tctcagctgt gttgatccta ctgctgcccc atcagaaaaa 480 

ttacctgtta tcagcgttct tcctttctta taatttgatg agtttcctcc tcatgcatat 540 

tgttcacctt tcgtacaaga gacctccaca tattaatcac agttaattta aatttccagc 600 

2 0 ctgtttcaat ttctcgatca cctctgagtc tagtcctgtt aattgcttag tgttattttt 660 

tgtttttgaa acagggtctt gctctgttgc ccaggctgga gtgcagcggc gcgatctcag 720 

gctgttccct gagttcacac catccccctc aaccagcaga ttgcaaagtg tccgagtcgg 780 

gccgtgcagg agtctttgtg ggggtttcat ggactccgaa ttctcatttc tgctccatcc 840 

ccatctcatg aatccaaggc cccactctgt gcctcggctc ttcgtttgtg gtgctgaacg 900 

25 tcatctacgt catctacgcc atctacgtaa tcaacacaat aaagacgcct gccgggaacg 960 

cggcccttcg gctgaatccc ttcggtggtt ccaaggccac tgccagagga tgcggacggg 102 0 

tctccagggc ctctacttac ccaggacttt gaggcacatt agcttcgcct aggcactcgc 1080 

ttttacgaat tcttatgttt ggttttgttt tgagacagag tctcgctctg ccgcccaggc 1140 

tggttaaaag atagggtctc agccgggtgc ggtggctcac gcctgtaatc ccagcacttt 1200 

30 gggaggccga ggcgggcgga tcacctgagg tccggagttc gagactagcc tgggccaaca 1260 

tggcgaaacg ctgtctctac taaaaataac aaaaatcatc caggcgtggt ggcgcgcacc 1320 

tgcaatccca gctactcggg aggctgaggc aggagaatca cctgaaccca ggaggcagac 1380 

gttgcagtga gccgagatcg cgccactgca ctccagcctg ggcgacagag ggagactccg 1440 

tctcaaaaaa aggaaaaaaa aaaaaaagaa aagaaacaaa agtgatgggg tctcgctctg 1500 

3 5 ttgcccaggc tagtctggaa ttcctgggct caagcgaccc tccagcctcg gcctcccaaa 1560 

gcgctgggaa tacaggcgcg gctaccgcgc ggtctccggc tgccgaaaca ccgccctgcg 1620 
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120 



cgcggaccgt 


tcggccgccg 


ggaggaacag 


cggctgcccg 


gagctcagag 


gcgcgcgcgg 


1680 


ctttgcgctc 


cccgcggcgc 


tctgagcctg 


cctcggcttg 


gttggccagg 


tggtctcttc 


1740 


aggaccaacc 


ccagtcattc 


ccggcaggaa 


ccacgcttga 


ggggcggcag 


tctgcccgcg 


1800 


cgagacgccc 


ccgcggacta 


caccgcggcg 


gcaaagccaa 


acgcaaaaac 


tacctcaccg 


1860 


cgcgcaggcg 


cctcccccag 


gaccaacatg 


gccacgacgc 


aaggcctcga 


cctgaggggc 


1920 


gtggcctggc 


cgccgccagc 


caacgggtgt 


gcgcgcctgg 


ccgcagccaa 


taggaaggca 


1980 


gcgcgggctc 


gggcgcaggg 


agccgccgcc 


ggggctgtag 


gcgccaaggc 


catgtccgac 


2040 


tcgtgggtcc 


cgaactccgc 


ctcgggccag 


gacccagggg 


gccgccggag 


ggcctgggcc 


2100 


gagctgctgg 


gtaggtgggc 


gcggcaggcc 


gcgggagtgg 


gcggcgtccg 


gcccgggacg 


2160 


gtttcgccgg 


ttccccgatc 


ccttcccgcc 


agagcctccg 


ccggtcggat 


ccccggacgc 


2220 


cgcgcccggg 


gggctgtgcg 


gggtgggcgc 


ccggctgggg 


cggcgcggct 


gcctcggacc 


2280 


cggcccctcc 


tgcgcctggg 


cggacgccca 


ccagaccgcc 


gcccgcaggg 


cgctcccttc 


2340 


tttcccgaac 


gccgcccccg 


ccggccgccc 


tgtcaggcgg 


gcctggggtg 


cgcggcctgg 


2400 


ggctcccctc 


agcgcagagg 


ccgcccctcg 


ccagccgtcc 


ccgggctccc 


ctgcctcggg 


2460 


ccctcctggg 


ccgtcttccc 


cggcgtccgc 


ggtggggccg 


tctccgttag 


tttcccgaga 


2520 


cctgcgccct 


ggggaggagc 


cccggcccct 


cttcgggagg 


gtgtcgctgg 


tgggtttctc 


2580 


cgcggcgtcc 


acctgcgcgt 


cgggccgggg 








2610 



<210> 72 
20 <211> 3076 

<212> DNA 
<213> Homo Sapiens 



<400> 72 

25 

gctgggatta 
atagctaata 
gtaaaatgtc 
gacagggtct 

30 gcttcgacat 
acaggtgtgt 
gctatgttgc 
tcccaaagtg 
atctttttct 

35 tataattagc 
gtctcactct 



caggcataac atggcccggc 
atgccgagca tctttttatg 
tttttttttt tttttggtcc 
cactttgttg cccacgctgg 
ccctgagctc aggtgatcct 
gccaccatgc ccagctaatt 
ccaggcaggt cttaaacttc 
ctgggataac aggcatgaac 
tgagttttgg gagtttttat 
aagtagtttc tcccactctg 
gttactcagg ctggagggca 



cctggccatg tttttaactg 
tgtttcttag ccattagtag 
atcttaaaat tgttttttgt 
agtgcagtgg ctcaatcatg 
cccacctaag tttcccgagt 
tttgtatttt ttttgtagag 
tgaggctcaa atgatcctcc 
caccacaccc agctaagatt 
gtgttaggga taccagtccc 
tgactgtgac ctttcttttt 
gtggtgtgat catggctcac 



tgtttctcta 60 

atcttttttg 120 

tttgttttga 180 

gctcactgca 240 

agatgggact 300 

gtggggtttt 360 

cacctcagcc 420 

gtttttaaaa 480 

ttatgaggta 540 

ttgaggcagg 600 

tgcaacctgg 660 
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aactcctagg 


ctcaagggct 


cctcccacct 


cagcctccca 


agtagctggg 


tctacaggtg 


720 




tgttattgtg 


ccagggttaa 


tgttttaaat 


tttttgtaga 


gataatgtct 


ctacaaaaga 


780 




caccatcttt 


gttgcctagg 


ctggtcttga 


actcctggct 


tcagggaatc 


ctccagcctc 


840 




agcctcccaa 


agtgctggga 


ttacagcatg 


agccacatcc 


agcctatgat 


ttttcttctt 


900 


5 


ttcttttctt 


ttcttttttt 


ttttttttga 


gatggagtct 


cgctgttgcg 


caggctggag 


960 




tgcagtgggg 


cgatctcggc 


tcactgcagg 


ctctggcccg 


cggggttcac 


gcctttctcc 


1020 




tgcctcagcc 


tcccgagtag 


ctgggactac 


aggcgcccgc 


cacatcgccc 


ggctaatttt 


1080 




ttgtattttt 


agtagagacg 


gggtttcacc 


gtgttagcca 


tgatggtctc 


gatctcctga 


1140 




cctcgtgatc 


cgcccgcctc 


ggtctcccaa 


agtgctggga 


tcgcaggcgt 


gagccacggc 


1200 


10 


gcccggcccc 


agcgtatgac 


ttcttaatga 


tgtctttgta 


gtacaagagt 


ttttaatttt 


1260 




aataaagtta 


actttttttt 


aaattgtaca 


agcttttagt 


gctgtgtcta 


acaacttgtt 


1320 




gccaaaccca 


aggtcataaa 


gctgttctct 


tacgttttct 


tttttttttt 


tttttgagac 


1380 




ggagtctcac 


tctgtcaccc 


aggctggagt 


gcaatggcac 


gatgtcggct 


cactgcaacc 


1440 




tccgccaccc 


gggttcaagc 


gattcttccg 


cctcagcctc 


cggggtagct 


gggattacag 


1500 


15 


gcgcacgaca 


ccacgccctg 


ctaatttttg 


tatttttgta 


gagaaggttt 


caccatgtta 


1560 




gttaggctgc 


tttacgtttt 


cttttagaag 


ttttatattt 


ttggctctta 


tatttagttt 


1620 




gtgatccatt 


gagttgattt 


tatgtacgta 


tgtatggtcg 


cgttcttttc 


tttcctgtct 


1680 




tttttttttt 


tttttttttg 


catatggata 


ttcaattctc 


ctagctccat 


ttaatttgaa 


1740 




atgattgggc 


aggtactttt 


gagcagtgca 


agtacagagc 


ggcactgcca 


gcagactaca 


1800 


20 


cgcggtagaa 


agccgacctt 


ggtgagcgtg 


ttggtgctcg 


acagtgagca 


gagaaaggat 


1860 




ggacgattac 


ggagcgccct 


cgtctccagt 


taccgctttc 


tggaaacacc 


atccgccggg 


1920 




gcggagctgt 


tccgccccgg 


tgcggtacta 


cgactcccag 


catgcacctc 


gcagtcggcc 


1980 




ctcggtggaa 


gcgggaaccc 


aggaggacct 


gggggtgtgg 


cagcgaggaa 


gggccgagcc 


2040 




acggactgtg 


gggccgaaac 


tcgctcccgc 


ccaccctttc 


tcgaggctgt 


ggcctccgcg 


2100 


25 


agagccgagc 


gggccgcacc 


gccggccgtg 


cgactgcccc 


agtcagacac 


gaccccggct 


2160 




tctagcccgc 


ctaagcctgt 


ttggggttgc 


tgactcgttt 


cctccccgag 


tttcccgcgg 


2220 




gaactaactc 


ttcaagagga 


ccaaccgcag 


cccagagctt 


cgcagacccg 


gccaaccaga 


2280 




ggcgaggttg 


agagcccggc 


gggccgcggg 


gagagagcgt 


cccatctgtc 


ctggaaagcc 


2340 




tgggcgggtg 


gattgggacc 


ccgagagaag 


caggggagct 


cggcggggtg 


cagaagtgcc 


2400 


30 


caggcccctc 


cccgctgggg 


ttgggagctt 


gggcaggcca 


gcttcaccct 


tcctaagtcc 


2460 




gcttctggtc 


tccgggccca 


gcctcggcca 


ccatgtcccg 


ccagaccacc 


tctgtgggct 


2520 




ccagctgcct 


ggacctgtgg 


agggaaaaga 


atgaccggct 


cgttcgacag 


gccaaggtaa 


2580 




cacggttgct 


ggcaccctcg 


gtttgcagcc 


tcaagatccc 


tgaaagcggg 


tttgcagtgg 


2640 




atttacccca 


acagatgggg 


agggactgag 


cttgaccaaa 


gagccagaaa 


tgactggaga 


2700 


35 


atgcatccct 


tgccactgct 


gcaaggggag 


aaaaaaggat 


tgatcctcag 


tgacaacccc 


2760 




tccctcatgt 


ggcaggtggc 


tcagaactcc 


ggtctgactc 


tgaggcgaca 


gcagttggct 


2820 
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122 



35 



caggatgcac 


tggaagggct 


cagagggctc 


ctccatagtc 


tgcaaggtag 


gcgggtcctc 


2880 


cccaggatgg 


tcagttcccc 


tcttccatag 


ccagagaaac 


atccgctcct 


gcgtttttgg 


2940 


gatcgatata 


attactcggg 


gcagggagtc 


ctgtttaagg 


cacagaggag 


actggagtgg 


3000 


aatcatcttt 


gtacaggcaa 


atccctctct 


tccttacaca 


ctcacagagt 


ggcatttgaa 


3060 


aaatggtttc 


caagat 










3076 


<210> 73 














<211> 2567 














<212> DNA 














<213> Homo 


Sapiens 












<400> 73 














cacaccatct 


cttgctccgt 


gagtatcttt 


gtctctctag 


ctcctcttct 


tctctcagta 


60 


catgtccctc 


cttgactccc 


gcctctctgc 


aaggtgtatt 


tggctgcctc 


agttggcctc 


120 


tccccctctg 


catctctggg 


tggggtgttc 


tctgcccgtc 


tcccacccac 


acccaccccc 


180 


ggtgctcccc 


ttccccccag 


caggacagcg 


gctcaggttc 


acgcacccca 


cggcgggccg 


240 


gctgggcgca 


cgcacgtcct 


tgcacacaag 


ccgcacgtag 


ctgtacttga 


gcacgtcgat 


300 


gagcgtgtag 


agcgggggcg 


cactggccca 


gcggcagcgc 


gccaggtgca 


tggagctctt 


360 


gacgaagaag 


agcgccagcc 


gctgctggca 


ccacgcgtcg 


aagaagcggc 


tgaactcggc 


420 


ccacgagaag 


aaggcccgct 


cccgcagctc 


ctgctcctcc 


tgccccgcag 


ccgtgccggg 


480 


tgggggctcc 


ggccgctcca 


tcctgggggc 


ctgcgtggag 


gaggggagaa 


caggtggata 


540 


tcagacccat 


tcccacccgg ggtatctcat 


ctactccatt 


cttggcctgc 


cccgtcggtt 


600 


gctggtgcct 


ctatcgaggt 


gggtagcccg gggtcggacg 


tgcctgtttt 


tctccaaata 


660 


tataaatatc 


aacctccatc 


ctatctttgg 


cctcctccca 


ccgccttatc 


cctggttcac 


720 


ttggagcctg 


tcatcttgat 


tcctaattcc 


aactcgtctc 


ctcctccgca gatgtgaccc 


780 


ttaggtacag 


ttggaatctc 


tcctcccaaa atacgaccct 


taagctcaga 


tgttccttaa 


840 


ggacatctcc 


tcaaatgtgt 


tctcaaattc 


cagctaaaac 


ctcctcccct 


tccagctgtg 


900 


tctctcaccc 


aagagtaact 


tctaactctc 


gtattcatct 


ggaactcctc 


cttccatgtg 


960 


ccaacagttg 


gctgtaaccc 


ctccaaagac 


gctccatctc 


cagatgtgct 


cccacatcca 


1020 


ggccacggac 


ccctcacccg 


gtcacatgct 


tcatgcacct 


gtggctccgc 


actccccaga 


1080 


tgtgcctctg 


gcgtgcagct 


gttgcccctt 


cccccgatta 


tgaccctatg gctcgccaca 


1140 


tgcagctgta 


gctggggctt 


ccctgagaca 


ctctcatctc 


cagatgtact 


ccccacatgc 


1200 


agttatccac 


gcttcgccta 


caggtgtgtg 


ccccacttgt 


ggctagttct 


cctcggaagt 


1260 


gtcaccagta 


ttcacctgtg 


gtcccctcct 


cctcagatgc 


ggcccccagt 


ccagctgtgg 


1320 


gcccctcctc 


ccagttacat 


ccaccatccc 


ccgcaatatg 


catcttcgtt 


ctagacatgg 


1380 



13 8 6 -WO 





cccctcgtcc 


tcggatgggc 


tccttcaccc 




ctcccctcga 


gcagccccat 


ccagcccgct 




cgctgcggca 


ccttccagcc 


ccgccgtccc 




gtgcaccctt 


cccgcccctc 


cccactcacc 


5 


caatgccccg 


cgcccaggcc 


tggcccggcc 




cgcctctctt 


cccgccgtgc 


ctcgcggggg 




cagtcactcc 


tcagaccctc 


agccacaccc 




ttttcccagt 


gtgctctgcg 


ggagggctcg 




ccattcggga 


gactacgact 


cccagtgtcc 


10 


ccaggtcccg 


cccctaggct 


ctgccccgcc 




gtggcgcgaa 


cttgggactg 


cagaggcgcg 




gcggcgcgcg 


ggaccaacgc 


aaggcaagtg 




ggcggcgggt 


gcgccgaatg 


cttggggcct 




gagtgggcgt 


ggggacgctg 


aggctgccga 


15 


gcgggccgcc 


ggagagtgca 


gagaggtgtc 




agacgagaga 


cagagacagt 


tggacaggtc 




acctggaagg 


gggagaggcc 


aaatggaact 




gtgggcgggg 


ccagctggaa 


atcggaaggc 




ggaggggctg 


aagggagcta 


aggggcgggg 


20 


acctggaagg 


gggaggtgcc 


aagggtgggg 




<210> 74 








<211> 2278 








<212> DNA 






25 


<213> Homo 


Sapiens 






<400> 74 








tcacagaagt 


caaagctcag 


gaaaagcccc 


30 


gtggggtggg 


attgtgcctg 


ccacagtgga 




gtggccagaa 


gcgggggatg 


gctctgctgg 




agcagtgtgc 


ccagctccct 


cagggcccgt 




caccgtccct 


gcccctccct 


aggggcatgg 




ggcgctgggc 


cgtcggggag 


gcctcagaag 


35 


ttcagccctg 


gtctggcctc 


ggcctctaga 




ccagaacgtg 


ggcgagctcc 


cttcagcatc 



123 



cagatgctcc 


ccccacgtcc 


agctgcgcgt 


1440 


cccgacgctc 


ctactccccc 


cctccccgcc 


1500 


acctagctgt 


gcctctcccc 


tccccaagat 


1560 


tacccgcccc 


ggagcggcgt 


ccacctccca 


1620 


cttgctcccg 


ggatgccccg 


cgcggtctcc 


1680 


cgcttccacc 


gattcctcct 


ctttccctgc 


1740 


gctcatccag 


ggcgagggaa 


agcgcgggca 


1800 


ccccacttca 


ccccttttcc 


cgccctcctc 


1860 


tccgcgcgac 


ggcggcggtg 


cggacggtgc 


1920 


cccgcccgca 


gacgtctgcg 


cgcgaatgcc 


1980 


cctggcggat 


ctgagtgtgt 


tgcccgggca 


2040 


gggccgtccg 


caagcagatg ggaggcggag 


2100 


atgcttcgcc 


atgtcggggt 


gtctgcagag 


2160 


gagcgcggtg 


gagacggaag 


agcgcgggct 


2220 


tcccagaggg 


aggggggcca ggtagagggt 


2280 


ctctgagaag 


aggccttgag gtgcgagttc 


2340 


gaggggcggg 


gcgggggggg 


ggaaaactgt 


2400 


cccccgaggg 


ggcggggcta 


tctgggaggg 


2460 


ccggggaaaa 


gattgcgtgt 


gggcggggcc 


2520 


ctggctggga 


accggaa 




2567 



tcgagggttt 


ttgtgcggca gaggtgggtt 


60 


ggggccctgc 


agacccagat 


aaaccttcaa 


120 


gtgctggggc 


tgccatgggc 


cgtgggagcc 


180 


cccctaggcc 


cttccgtcca 


ctgggccaag 


240 


atctgacttg 


agaggttgtg 


agagcttaca 


300 


cgtaggacgg 


ctgcgcactg 


ccgggccgtg 


360 


ggaggctgcc 


tgcgctccag 


caggcccaac 


420 


cctgggcgga 


aagagggatg 


ggggctctgc 


480 
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tgcagaggca 


gaatccgcgc 


cgctccctcc 




cggccagggg 


cgggggcctc 


cgcacaagcc 




ggcccactcc 


ggaccagggc 


tggggcggcc 




cggaaaacac 


gtctgcggcc 


cgccccctcc 


5 


tcgaggatcc 


actcaggttc 


acgacgggcc 




tggagtgggc 


tctggggccg 


ccaggcgacc 




ggaggggcgc 


cggggcacag 


tgcggggctc 




tcgcctcgca 


ggcaccgcat 


cgggcccggg 




gaagaggatc 


cacctccacg 


tggcccgccc 


10 


tcactgcccc 


ccttctcccg 


gcttccgtcc 




cctaggggtg 


cttccgtggt 


cggcggctgc 




aagccccggc 


ccgagccgcc 


ggatgcccgc 




cggccgcccc 


gcggaacaga 


cgcgcccacc 




agcgggagtg 


ccggggacgg 


gcgtagcgcc 


15 


agcataggcc 


aagggccaag 


ctcgggccga 




cccacggcgc 


gctggcagcg 


cgggccgagc 




aggccgactg 


gcagtctccg 


ttctacatta 




ctctcccatg 


ccaggaggac 


ttaggtgctg 




ccggcgctca 


cccgccacct 


tcatcttccc 


20 


tgtggttagg 


ttccccctac 


ccatggggag 




gtctcagcag 


accctcctag 


tccctcagga 




ccagcctggg 


ccatgaagca 


gaggacagtt 




gtgctcaccc 


ctccctgtgt 


gtccgacctt 




tggcaagggt 


tggatgaaag 


ggcagggctc 


25 


ctgcatgatg 


agtgggaatc 


ggagggcacc 




ctgctcccca 


ggtcatgcag 


cctgggcccc 




agagcagcca 


ccctgccccc 


age age age c 




gcctcgagtg 


tgtcagggtg 


tttgtgcctc 




ttgcagatga 


agaaaccaaa 


gcaggtggtc 


30 


ccttaccaca 


cagctgtctc 


c tat tccrcrat 




<210> 75 








<211> 2401 








<212> DNA 






35 


<213> Homo 


Sapiens 





124 



ttccttcccc 


cgaccagcct 


gtgacaaccc 


540 


tggcgtccac 


ttcctggata 


aggactcccc 


600 


tcccaggcgc 


tcactccgct 


ggcaccccac 


660 


cccaaagcac 


gaccactccg 


cccgggcccc 


720 


cgtcctctcg 


gtggtctgac 


caccggctgg 


780 


agggegcagg 


egggggegga 


cagctcattg 


840 


gccccacccc 


caggtgcccc 


ttccccgctc 


900 


aatcggtccg 


gaectggegg 


tgggcgctgg 


960 


cgccccgggg 


gcgcagccag 


ttcccggcgc 


1020 


ccttctgcgc 


aggcgccgct 


ccgccccggt 


1080 


tgggctccgc 


gccggggtcc 


gagtcccacg 


1140 


gcgcagcggg 


gcccaggtga 


gcgcgcgcct 


1200 


cccaggcgca 


geagegageg 


cggccgcggg 


1260 


caccgccccg 


agggttcggg 


geagagecag 


1320 


gagcagtggc 


cgcagcgccc 


gggggctgaa 


1380 


tgcggagacg 


gtcacgtcag 


cgtccgttcc 


1440 


acgtcagcac 


tcccgttaaa 


aataatgeat 


1500 


ctaaagacca 


gccctccggg 


tgetgecagg 


1560 


ttctcctttg 


ccccaggaca 


gecgaggatg 


1620 


gecagaggtg 


ggaggctggc 


ggcctgctcg 


1680 


gaccttgcct 


ttgccccact 


tgctcgttat 


1740 


agggaccctg 


ageacgeggt 


ggtcaccccg 


1800 


ggccctgcta 


agatcctgtg 


ttttgaattc 


1860 


cagaaaccag 


etcagaegtt 


tgcttgggac 


1920 


agccctgctg 


tcccaggctc 


aggcccccat 


1980 


catgccgtgc 


agctcgcaca 


tatgtggggc 


2040 


gtccatcgtc 


agaegtgate 


atttcctgag 


2100 


ataacaaccc 


acaggatggt 


cacccccgct 


2160 


agatccagtc 


cttgcacttc 


ctgagcctga 


2220 


gcttatttat 


tttttttccc 


attacagt 


2278 
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<400> 75 





tcatgcctgt 


aatcctaaca 


ctttgggaag 


ccaaggtggg 


aggactgctt 


gaggccagga 


60 




gttcaatact 


agcctgggca 


acacagcaag 


atctcatctc 


taccaagaaa 


aacaaaggat 


120 


5 


agaggagtca 


actgaaaaag 


atcccagtga 


ctaaagctcg 


aacaatttta 


gcaataaaat 


180 




aaatacgcat 


gatataaata 


catggctgaa 


taaataaact 


ggggagaata 


gaaaaatatc 


240 




ctgtgcagaa 


gaattccaag 


taacttatat 


agatatttta 


cctttacctt 


caaggaagta 


300 




gaacataact 


tttcattcct 


tcccaggatg ggctaggcat 


gatgacttcc 


ttccaaagag 


360 




tacagaacgg 


aaacagggca gggggattaa 


cagtggagaa 


acctgaccaa 


cgctactgca 


420 


10 


gctaggtgat 


caaggccaaa 


acatcgacag 


tgataaagca 


tgctgagagc 


acctttgatt 


480 




tgatgtagtg 


aaaatcgtgc 


tttacctctg 


taatcttcct 


gccaaaaacc 


cataatccca 


540 




gccccaatta 


tgagagaaac 


attaggcaaa 


tatcaattga 


gaaatattct 


acaaaatacc 


600 




tgactggtac 


tcctgaaaac 


tgtcaaggtc 


accaaaaaca 


ataaaagctc 


aagaaactgt 


660 




cacagcccag 


aggaacctaa 


gatgtgacta 


ctaaatggca 


tgtagtaccc 


taaatgggat 


720 


15 


cctggaacac 


aaaaagagta 


tcaggtaaaa 


actaagagaa 


tcagaataaa 


gaaaggactt 


780 




ttgttaataa 


tagtgtatca 


atattggttc 


atcaattttg 


acaagtgtac 


catactaata 


840 




atgcaaggtg 


ttaataagaa 


acattcagca 


tgagattttt 


aggaattttc 


tatattatct 


900 




tcacaatttc 


ctgttaatct 


aaatctctcc 


taatgacaag 


tttatttaaa 


aagtaaaaca 


960 




aaacttgaag 


gagggaggaa 


acaagaaggg 


aggaaacatt 


ggagacagaa 


ccagcttggc 


1020 


20 


aagttgacag 


ataaggtctg 


agaagtaggc 


aggggaaaga 


tcattcattt 


caggcaatat 


1080 




ttttccattt 


tacctgtata 


agaaccatat 


gagccctatt 


tttctttctt 


tcttttttct 


1140 




ttctttcttt 


tctttttttt 


ttttttttgt 


agagatgaag 


atttcactat 


gttgaacagg 


1200 




ctggtctcaa 


actcctggcc 


tcaagcaatc 


ctcccacctc 


agcctcccaa 


agcatgagcc 


1260 




accatggtgg 


gcctgtatga 


aggaactttt 


taaaaaatgc 


tacaagccgg 


gtgcagtggc 


1320 


25 


tcattacctg 


taatcccagc 


attctgggag gccaaggtaa 


gaggatcact 


tgggcccaga 


1380 




agttcaagac 


catcctgaac 


aacatagcaa gaccctgttc 


tctgcttaaa 


aaaaacaaaa 


1440 




acaagctggg 


cgtggtggat 


cacgcctgta atcccagcac 


tttgggaggc 


tgaggtgggc 


1500 




agatcatgag 


gtcaggagtt 


cgagaccaga 


ctgaccaaca 


tggtgaaacc 


ccatctctac 


1560 




taaaaataca 


aaaattagct 


gggcacggtg gtgtgcgcct 


gtgatcccag 


ctactcagga 


1620 


30 


ggctgaggca 


ggagaatcgc 


ttgaacccgg gagacggagg 


ttgcagtgag 


ctgagaaagc 


1680 




agtgagctga 


gatagcacca 


ctgtgctcta gcctgggaga 


cggagtgaga 


ctctgtttca 


1740 




aaaaaatcag 


cctgcccagt 


cagagcgcct 


cagcgccgtg 


ctcgggacat 


cccgccctgc 


1800 




ggccagcccc 


cgcgtgacgt 


caccgcattc 


cggctccgct 


cctcccgccg 


cggcgcccgc 


1860 




accgcagtga 


cagccagccg 


ggcccggtgc 


cggagaggaa 


gtgcggtccg 


cgccaagccc 


1920 


35 


gtccccgccg 


acgccggctc 


cccgcggctc 


gggtgacagc 


gtcgcggccg 


ccggacgcag 


1980 




cgcggggcag 


gcgcgggcag 


agccgagcgc 


agcggaggct 


ccggcggagg 


cgcggggaaa 


2040 



L 
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atggctgatg 


actttggctt 


cttctcgtcg 


tcggagagcg 


gtgccccgga 


ggcggcggag 


2100 


gaggacccgg 


cggccgcctt 


cctggcccag 


caggagagcg 


agattgcagg 


cat agagaac 


2160 


gacgagggct 


tcggggcacc 


tgccggcagc 


catgcggccc 


ccgcgcagcc 


gggccccacg 


2220 


agtgggggtg 


agtcagcgcg 


gggcctggag 


aggggctcag 


ggcgcgcacc 


cgggggaccc 


2280 


cggccggggc 


ccaggggcac 


agggaagaga gcctgctcta 


ggccacccgg 


ggcaggagct 


2340 


gggagacgtg 


gggaagaatc 


ttcttggaga 


tctccatgta 


ggacttccga 


gctggggatg 


2400 


a 












2401 



<210> 76 
10 <211> 2501 

<212> DNA 

<213> Homo Sapiens 
<400> 76 

15 

ccagcctggg ccgcagagtg agaccctgtc tcaaaaaaag aacctactag tctacatacc 60 

acacttcctc atccccatct gagactatat atattttttc taacatgagg caatgccaaa 120 

aagaggggct ggtgagtgaa agtaagaaca gaaagacatg gaggcaagtc ttatagaata 180 

atagccaaca cttaaactta cacttaacag cgtgataggt attgttccaa acacattaaa 240 

2 0 ttcatttaat ggtccttaca tgtctatgta tttggtgatt attatcctta ttattcacat 300 

tgctgagtgt attattctgt tctcatgatg ctgatagaga catacccgag actggataac 360 

ttattaaaaa aaaaaaggtt taatggactc acagttccac gtggatgggg agtcctcaca 420 

atcatggtag aaagcaaaag acacgtctta catggcagca gggaagagag agaaatgaga 480 

accaaacaaa aggggtttcc ccttataaaa ccatcagctc tcatgcgact tattcactac 540 

25 catgagaaca gtatggggga aaccaccccc atgattcaat gatctaccag gtgcctccca 600 

caacctgtgg gaattatggg agctacaatt ccagatgaga tttgggtggg gacacagcca 660 

aaccacatca ctgaggaaac tgagttatag ggagattagt aacgcccaac acagctggta 720 

ggtggtggag ccaggcagtc tgactctagg gtctggactc tgaactgcat catgctgcca 780 

agaagttcct cattttttcc tctctctaag tttcccttat tcccctacag tcattccttc 840 

30 aacagcattt ccttcaccat cttttctact tctactatat aattaatttt ttcttcttgg 900 

tcccaaattc caacgtgcaa atgcagcctt atatacccta attcatcttt acctttagac 960 

tttcttccaa tgtttctact tcattccatt ttaaatttat ccatgagatg cctatttaca 1020 

agctgtaacc atcatgaagt gaatgaagaa taatacctac tactgtacaa tagaattcca 1080 

agagtataaa taggagttat ggctttctga cttgaaacta aatacttgat acttgatttt 1140 

35 gctgtctgag atcaatctga aaagtaataa taatcactaa catttgttga gcatcaattg 1200 

tgggccaagt gtcatttcaa tcactctgta catattaact catttcatcc tacaacaacc 1260 
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cggtgaggca 
aagtagtttg 
agcacagctg 
aagaggaaaa 
5 attcctttca 
tcccaagcgc 
actaattttg 
ataattactg 
caaatgtact 

10 ctgttcccac 
gaccatcttc 
ccaacattac 
tggaatgtga 
ctgtaagcag 

15 gagtttattt 
tcaggcaggt 
ttgcctatta 
ccttgtgagt 
ctgtgtttta 

20 ctggctccgc 
atggtaaacc 



agttctgtta ttctgtttta 
cccagtagat agccagaaga 
aagtcttaac cactatgcca 
taaaaaggta tctatttgta 
catgacctat gtctatttaa 
cagaaaactg ttagtggctt 
cttctttccc tcagaaggct 
gaaaatgggc actggaaaat 
actttaaatt ggagcttata 
attgctcagc ctgggatatc 
catggaaggg ggtgacccct 
tcttgcatta tcaacattct 
ataggaagtt gtttttctaa 
tctgggttgg gcagaaggca 
ttttctgcat gggaagtggt 
aaaggttcag aaagtgagga 
gattttgtga gtacttttaa 
tgaggttgct atttctattt 
tctaagcttc actgttaggc 
agctgctgtt cctcctacta 
tgtcacccca ttttccaaga 



cagttgagga aacagaggca 
ggagccagga tgggtctcgg 
acagcttttt ggtcctacac 
taccttttta tttctgatat 
tacgtcattt tgaaacttac 
tttccatttc tctctatttt 
gccggaatag taaacattca 
cacattgtaa ttaattcaaa 
tcataatcca aggaaacctt 
caggagtaat tcaccttgcg 
tgcctcttgg caaccactat 
aacttcatgg gaagggctgt 
acagcctgac actgagggga 
gaaaaccagc agagtcacag 
tgaagtgagt tggagtggta 
acagcgatag ccatggagtt 
acttgctgtc cactttgacc 
tacaaataaa gccatcgtgg 
tacatgatgt tgggatctgg 
gaatttatag gggctctctg 
ctgtacttct c 



tagagagctt 1320 

gcagtttaac 1380 

atcccatggg 1440 

aagaagcaga 1500 

caataaaatt 1560 

tttttgtgct 1620 

ctgacatgtc 1680 

gcatgttttc 1740 

tgtgtgtgta 1800 

cctgcctcca 1860 

ttctaagctg 1920 

ggtgagtttc 1980 

ggcagtgaga 2040 

aggagatggt 2100 

tggagtaaag 2160 

ttatgttgaa 2220 

ctcccaacac 2280 

tttacagagg 2340 

ggcctgtcct 2400 

agaatagatc 2460 
2501 



<210> 77 
<211> 2501 
25 <212> DNA 

<213> Homo Sapiens 



<400> 77 



cctgggtcct 


ctcttccagc 


tcccaaaatg 


tactctattt 


ttatctgttt 


cacgaacgct 


60 


ggtccagata 


gtcttccatc 


ccccactgac 


tgttagaagt 


gactctcagc 


ttttgtccat 


120 


ctcgaagttt 


ctgtgctcag 


tgtgcctctc 


agactaaagg 


cttcctttgg 


gaagccccga 


180 


ctctcgcttc 


tcaggacaga 


gatccagggg 


ttgggggagg 


aaaaggttga 


ccagaagcca 


240 


tagcggagca 


gggagagaga 


gtgtgaaaga 


cagacccgcg 


gccaggctcc 


cagttctcca 


300 


gctcgtagag 


ggcccaagtg 


gccgctataa 


tctgaaagag 


cagatatcgt 


aatcccatag 


360 


tacttcctat 


tggctgcagg 


acacagttct 


gtcctgacac 


tgaaatttgg 


gtgtgtcagg 


420 
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gttctgggaa 


ttcacaacgc 


tcacaacttg 


tgaagcagct 


gtggggtggg 


ggatggggag 


480 




ggtttcagca 


gaggaagtga 


ggtcagtcaa 


taattgatgc 


ctgtctgagc 


ttttagccat 


540 




tatctccccc 


agcctctatt 


cctgtcaaaa 


ggtggggcgg 


ggcaggagga 


ggggtccctg 


600 




gctcatcttg 


tagaatcccc 


atattagagt 


aagacacctt 


agaggtctac 


tcctgcttct 


660 


5 


aatacccacg 


tctttccaag 


tgtctctgag 


gccaccccct 


ccccagcctt 


ttcatttatt 


720 




catttaatta 


acgaacgcct 


tcattgaggg 


cctcctctga 


gtcaggctca 


gccagccagc 


780 




atctttgcta 


tgagctgaga 


taagcatcat 


ttccgtctat 


tctcacaacc 


accctatgag 


840 




gctggcacgg 


tttactatgc 


ctatttagca 


gatgggggac 


tgaagcatgg 


agaggtgtca 


900 




ctagcctacg 


gtaacacaac 


cagcctgcat 


tcctagtagg 


tagtttgact 


tcagagtctc 


960 


10 


tgtggataac 


caggaggcta 


ggactaagac 


cagagtcctg 


caggtactta 


gatggttgga 


1020 




gcaaagcagg 


gcagtgaggt 


cagtgctccc 


agcctgtgca 


ggagcatcag 


gaagagtctg 


1080 




tgtccccctc 


ccctgccggt 


atgaagccat 


tctgcttccc 


tccccagctg 


ccttgtgtca 


1140 




gcagagttcc 


agggaggctc 


cattccccac 


ctctatctaa 


agctccattt 


gctggggtgg 


1200 




gggccctgcc 


tggaagggga 


aggtccaagg 


ctgctcccag 


cgtgtccctc 


catcctgact 


1260 


15 


gtccctggcg 


gggcgggggt 


gtctttgtca 


cccagctgca 


caacggccag 


gaagggctca 


1320 




aaccatcctc 


agggctaacc 


caaggccgtc 


ctctgggcct 


gtatacccct 


gtgctgagtg 


1380 




cggatcggga 


gaggctgctg 


aagacaggag 


gggacaaatg 


ggggacgaag 


gggcccgagg 


1440 




gaggggactg 


aaggatttgg 


gccaagtcgg 


gagttcccga 


gggcggagtc 


aaaacgcatc 


1500 




tggattttgc 


tagccccaaa 


ctctgccctc 


attgctgcaa 


gcctcctaga 


ccgaggaccc 


1560 


20 


ccgggctgag 


ggtggggtaa 


ggataggtag 


tgtccctccc 


cgtcccaccc 


ccgcctgtcc 


1620 




cttcctcggt 


ggccccttcc 


cggcgccccg 


attccaggcg 


gcccctccgc 


tgctgccagc 


1680 




cgatccccct 


ctacccccac 


ccactactcc 


ggccgccaga 


cgttgcctac 


agtctcggct 


1740 




ctgtctccca 


cggctgtggg 


tccggacccc 


acgggacccc 


tatgggaccc 


ccacaggacc 


1800 




cccacggcct 


gagtccaagg 


cccgccccct 


cggggaggcg 


gatgtgggag 


gcccggcccg 


1860 


25 


ggtgcgggcc 


agcgacccgg 


gagctgcggg 


cggctgggag 


gggaggccgc 


cctgaggggc 


1920 




tgggagcggc 


gcgggggtgg 


gtcccggtcc 


tgcagcccca 


gcgaggggcg 


agcggcggcc 


1980 




agtcggcgag 


ctgggcaata 


aggaaacggt 


ttattaggag 


ggagtggtgg 


agctgggcca 


2040 




ggcaggaaga 


cgctggaata 


agaaacattt 


ttgctccagc 


ccccatccca 


gtcccgggag 


2100 




gctgccgcgc 


cagctgcgcc 


gagcgagccc 


ctccccggct 


ccagcccggt 


ccggggccgc 


2160 


30 


gcccggaccc 


cagcccgccg 


tccagcgctg 


gcggtgcaac 


tgcggccgcg 


CQQt.aaa.aaa 


2220 




gaggtggccc 


cggtccgccg 


aaggctagcg 


ccccgccacc 


cgcagagcgg 


gcccagaggt 


2280 




gagtcgaggt 


ccgcggacgg 


gaccgggtgg 


cgggcggcct 


gacccccgct 


tcagtgggcc 


2340 




cttccttcgg 


gcggacccca 


gagtcaccgc 


agagtggtcg 


cgggaggctc 


agtcccagct 


2400 




cattagaaag 


gcaagctgct 


cctggctgac 


cacgcacagc 


tcccatgacc 


ctacctgaga 


2460 


35 


cttggagggg 


aatggacgag 


actggactgg 


aaatcagaaa 


c 
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<210> 78 
<211> 2501 
<212> DNA 



5 


<213> Homo 


Sapiens 












<400> 78 














tggctaattt 


tttgtatttt tagtagagac 


ggggtttctc 


catgttgagg 


ctagtctcga 


60 




actcctgacc 


tcaggtgatc tgcccgcctc 


agcctcccaa 


agtgctggga 


ttacaggcgt 


120 


10 


gagccaccac 


gcctggccgc taactacatg 


tgttctatga 


ggtgaggtcc 


ttcccagacc 


180 




ctggaatcag 


gggttgcaat tagggtccaa ataatgaggt 


tggactacag 


ataacccatc 


240 




tcctttctta 


cctttgacta gatccaagga ctaaactcca 


agaacccgag 


catctgtccc 


300 




caaaactgaa 


aggattggac tagtcacccc 


ttgtttccct 


acagccacat 


cccaggcacc 


360 




tggcccttgc 


tttgtccaga aattcagcta 


taactccaca 


catctgatgg 


ccctttctgg 


420 


15 


caagcaggca 


tttccatcag gaccctcagc 


tgccagacac 


atttactgga 


ggtcacttat 


480 




taaacctggg 


ctcaatttcc acacagggag gctactgaag 


catcacactg 


ggtctcccag 


540 




ccccttctca 


tagaggaaag atctctctgt 


cctgcagggt 


tggcagtcag 


cgccaagtaa 


600 




agggaattta 


gctcttggcc caagatccct 


gcccaggaaa 


ggtacttgcg 


cctgctggaa 


660 




actttgggct 


gaagtatact cctttccaaa 


aactcaggtc 


tgatatttac 


acaaagtctg 


720 


20 


aaattaatgc 


agagaaaact tccaagtgct 


tggactggag 


cagaaggctg 


agaacaggaa 


780 




ggggctggtc 


cctggtacta gttttggttt 


tttggtggtt 


tttttttttc 


ttgttttttc 


840 




tcacagaaca 


gggcaaagct gagtgtccct 


ggatgagtga 


agcaggagga 


ttaatcatgc 


900 




ccagtgcttc 


tccactttaa actggttttc 


ctgggaattt 


gcaattgaga 


gtggggaggg 


960 




gtaagaatcg 


tgggaaaagg ctgatggtgt 


tcagccaaat 


tcatccttca 


cgtgcccacc 


1020 


25 


cttctacagg 


cacatgcttt ggggccatcc 


acggctgcag 


ccaccccatc 


cttaggaagc 


1080 




accactggcc 


ttcctttccg gtacctggac 


tcagcatcac 


tcccagcctc 


ttggagatgc 


1140 




agccttcatt 


cagcacacag ctcagctctg 


agttctgttt 


ttgtccctag 


atgtctctgg 


1200 




ggtcacctac 


tactccctgc ttggtggccc 


aggcccatcc 


ttctccactc 


ttgcacctct 


1260 




tttagcagaa 


aaggagtgag aatggatatt 


tccatgggcc 


gtgtgtgcac 


tcccggctac 


1320 


30 


ccctgacagc 


tctactcaga gctaccctcc 


ctcctggggc 


ttcttatgtg 


ttctaaggct 


1380 




gaggcaggaa 


gactgtgaga tcaggtgaca 


ctcaacagtt 


atgatcggtc 


ttaagattaa 


1440 




cagtcctggc 


cgggcgcagt ggctcacgcc 


tgtaatccca 


acactttggg 


aggccgaggc 


1500 




aggcagacca 


cgagatcagg agatcaagac 


catcctggct 


aacacagtga 


aaccccgtct 


1560 




ctactaaaaa 


tacaaaaaat tagccaggcg 


tggtggcggg 


cacctgtagt 


cccagctact 


1620 


35 


caggaggctg 


aggcaggaga atggcgtgaa 


cccaggaggc 


ggagcttgca 


gtaagccaag 


1680 




attgcgccac 


tgcactcccg ggtgacagag 


cgagactccg 


tctcaaaaaa 


aaaaacaaca 


1740 
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acaacaacaa 


aaagattaac 


actccttcta 




tagtgattaa 


gagaattcat 


tcattcaaca 




gcactgttct 


aggcaccgga 


aatacagcag 




atggggtgta 


tattcaagta 


gctgaaacag 


5 


tcaaataata 


atgatgctat 


cggccaggtg 




tgggaagcca 


agtcaagcgg 


attacctgag 




tggtgaaacc 


ccatctctac 


taaaaataca 




gtaatcccag 


ctacttggga 


ggctgacgca 




ttgcagtgag 


ccaagatctg 


acaggccttc 


10 


agcgagactc 


tgtcaaaaaa 


aaaaaagcta 




ggagggaggg 


ataggggagc 


agggtggtca 




caatggccag 


cgagggaaag 


atccagaagt 




aacccttagg 


acaaaatcag 


ctcgtgtggt 


15 


<210> 79 








<211> 2501 








<212> DNA 








<213> Homo 


Sapiens 




20 


<400> 79 








tgtttctgac 


ccctggctgc 


agcctaatgg 




gtttgaattg 


gtgctgcccc 


gacatcctct 




gcagacaggt 


ctctgcaacc 


ctatgggtgg 


25 


cacttgcaca 


gattttggtg 


tatggttctg 




cagtccctct 


tgcaagtgtg 


gggagggggt 




aggggctcca 


gcagaaccag 


ggtccaatgg 




tcattacgtc 


tactcctaaa 


tggaggaaac 




aatgtcacca 


tggtgctgct 


ccacaggatg 


30 


ctagggacag 


ctacttcatt 


ctgccctccc 




ggccaattgg 


cacgtcaagt 


gccagctcca 




aggcagctgg 


gatcgattca 


ccatgctgcc 




gcagtgagtt 


cttgccttgg 


gccatttcat 




tgaacacact 


ggactcccaa 


tatgggtgga 


35 


accactcata 


agcatggttg 


tgggcagttt 




gagccgagga 


actggtcatc 


tccaggtgca 



130 



cttccaaacc 


taatacaaag 


ggacattgcc 


1800 


aatacttgtt 


gagcacctac 


tatgtgccaa 


1860 


tgagaaaaac 


caaaaaaact 


ccctgccctc 


1920 


acagtgaaca 


aacaaaaaag 


gacaataatt 


1980 


tggtggctca 


tgcctataat 


cccagcattt 


2040 


gtcaggagtt 


caagaacagc 


ctggccagca 


2100 


aaaattagcc 


agacatggtg 


gcacacacct 


2160 


ggagaattgc 


ttgagcccgg 


gaggtggagg 


2220 


agcaccactg 


cactctagac 


tggctgacag 


2280 


taaatagact 


ttaacagggt 


aacatgatag 


2340 


aggaagggac 


atttaaacag 


gctagaatga 


2400 


gtgtgctgga 


agaagaaaga 


gcaagcacaa 


2460 


caaggcacag 


c 




2501 



gccgactgct 


ggacagcggt 


cctgagtcct 


60 


gacctcagct 


aatgatcctg 


cctgccgagg 


120 


taggggtggt 


gatgagagga 


gaggtagtct 


180 


tcttttgcac 


tctttcaaca 


gaggtctgtc 


240 


ggtgcaggac 


tatgaggtaa 


ctgtgagaag 


300 


ccttgaagag 


atggctgggg 


acagctggac 


360 


gacccctcag 


ctacacagca 


cctgagccag 


420 


acagctacct 


ggtttgtgag ggcccctatt 


480 


agagcagcaa 


gcaacaaccc 


tatgccagga 


540 


atcgattgat 


agtagctgcc 


tggctctgaa 


600 


agcacacaga 


tggacccagc 


ggtggtccca 


660 


tttctttgtc 


ctggccaagg 


aatgattgga 


720 


taagacaaga 


gtgtctggtc 


acacccctcc 


780 


ggttccccag 


gcggccttgg 


agaatgcaat 


840 


tccagggcag 


gaaaggatga 


cagcatgcgt 


900 
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gagccagggt cactggctaa gaagtcatct 
cagcatccct gctggttccc ccctacacca 
ggtcccactc atgtcaggtg ctcttaatct 
acaggcagtg aagcaccgtt tcttcctgac 
5 gtacggggtg cgggcataaa cgcacaactc 

tgagaatgcc aggaggaggt aagaaagggc 
agcccaagag gtctctacat ccaggcctgg 
gggtcccttc ctgccctcgg aaatactgca 
ggacccaggg gcgtgaggat gagagagccc 

10 agcaagtgag aacctctctg aggctgtttc 

tctctctcgc ggcttgcgtg aagaatacaa 
gcgcgcagcg agtagcaggc gctgaagaag 
gctacgcggc cagagtccgg ggaaggggcg 
atggtcgggg gatgttctcg taagtcggct 

15 cgccgccgca gaggttcggg caggtgcggg 

tccgccggga ctgacatccc ggaggcccaa 
aatcggcgcc ggttgccgtg ccgcgccggg 
gatggggcgg ggcggagatt cgcgtcgccg 
agggcgggag ggctggcccg gggtctcggg 

20 cccccgcacc ggtccgagtt gcggccgcgt 

ccgcgccctg ggaacgccgc tccccgcgcg 
ccgaggccgc gctgcggggc cgcccacgct 
gggtgtggga cccggggctg gggcctcggg 
atatgggacg cattccgggc agcggtccgg 

2 5 ggcgagggtc tgggccggac gtcggagcca 

gctctgcggc gtccgagacc ccgggccggc 
ggccctaccc tacccgtccg ggcgtcttgc 

<210> 80 

30 <211> 2501 

<212> DNA 

<213> Homo Sapiens 



131 



caggacctcc 


ccctagaaaa gcccactggg 


960 


caaggttacg 


cagagctggc 


ggagggtcat 


1020 


ggcaaggaaa 


tgtaacctac 


gtgaatctca 


1080 


tccaggtagg 


gtgaagaaaa 


tgggacagta 


1140 


tgcctcccca 


gacgcagagc 


tgtggggctg 


1200 


ggccccatgg 


ggggcctgca 


gggtgggaca 


1260 


tgggggaggt 


gagcccctgg 


tttaccgagg 


1320 


gctcctacct 


ccatcgtctc 


cccgctgcgg 


1380 


ccaggcccca 


gggtcagacg 


actgtgttca 


1440 


ccaactgtaa 


aatggggata 


gcagcagaac 


1500 


ttcgatgtcg 


acaggaggga 


gcggcgcgca 


1560 


gatacctgtg 


aactgggagt 


ggtggcggag 


1620 


ccggctctgc 


cagtccctgc 


tcggggctgg 


1680 


gggagggagc 


ggtcccgcgt 


accctgccac 


1740 


gccgcggccc 


ctccgcgagg gggccggtca 


1800 


tggcaagccg 


tcatctccgc 


gcatccgccc 


1860 


tctctcgacc 


aatgggaaaa 


tttgctgtca 


1920 


gcccggtccg 


ctttgcgcac 


gggccgcgtg 


1980 


ttgcgcgctg 


ggcctggagg 


gagggggcgg 


2040 


ggactgcgac 


ccgcgccgcg 


ccgcaccgcg 


2100 


ccaacggacc 


cggggaagcc 


cttctggggt 


2160 


gcgctccagg 


taagcctgag 


ccagtgggcg 


2220 


tcggagccgg 


gactgggggc ggggctgcag 


2280 


acagggtcct 


atccctggag 


tcgagatccg 


2340 


atctccgccc 


cacccgcgtc 


ttgtccgcgc 


2400 


gggggcgggt 


ctctttgtgc 


gtggccttgg 


2460 


actgagcact 


c 




2501 



35 



<400> 80 

acagatgacc gaggggctcc cagcccggga ggtggaaatc cagcagggat ttccaaggcc 



60 
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tagtttgcag ggctccagga tcgttcctag atcctggtct tgcagccttg acaaggggaa 120 

ggagggaggc agcagaagga gggcagaaca atccatgcca ggctgtgatt tgccaagtga 180 

ccatctggga agaatgggct ctcagaccag ggacagggag cagaggcaag cccgcatctg 240 

ccctggttgc agaacccgga ttcagactca gggccccgat ttctgcctgg atcgctccac 300 

5 tgggcggagg agtgactgtg gacacatcca gggttctctc caagtcggct tcctcatctg 360 

ccaaatagag accgcagacc accagctccc aggcaggtgc tactcttccg gcccctccca 420 

aggcaggagg gccaggcgta ctcgagacac aggtgtgctg ggggcccagg tgggccagcc 480 

agcagcatcc tgcagggtaa tgggagcagg tgggcacccc gaggctggca gtaaacactg 540 

gctatctgcc cccaggctcc caggaggggt cttgggcctc acctcctccg gccggaacag 600 

10 gaaagcagct ccaggcagct gggtccacaa aaatctccgt tccctgaggt ctcagaggca 660 

gtggcccagg agcatctggt caccttcggg aaaaaccggc ttggcaaagg ctcccccgag 720 

ggcacgcgtt tcccggacag tgaggcagga cctaaactct tccgttaaca ctacattttt 780 

cgcatttctg cagtgtttgc actctcaggc cccaccattt ccccgcatct cttagggaga 840 

agttctcgac gtcccacctc ccctggaagg gtgctgctcc cagagacctt caggccaatg 900 

15 gcccaatctc agtgccctca ggggagaggg gggtgcagaa aaacagcctg ggtcacaaaa 960 

gaggtgcgag ggctgtgaga tcccggaggc accgacggga agcgagacgg agaacaggag 1020 

ggcaggacgg gctggaggtg ggggatactg cagatggagg gagccacggt gggggagggc 1080 

gtggacctga ccgtcctggc acaaggcggt cgggtgcaga cctccaggcc ctccgggtta 1140 

aggtgccgcc cagagccctc aggccggggg cgcacggaaa ccacaggcag ggtgcgcgtg 1200 

20 gagggacggg gaaagcgggg cgggttgggg aaggcgcccc gggaacctga acctcccacc 1260 

ccgcctcagt ctcgaccact ccttaagccc caccccgccc caggtaaggc gcagtccacc 1320 

cccattccca gtagattaac gcacaggtgg gggcgcgctc gggacatagc tgcgctaggg 1380 

gacagcgcgc ccagcccagt cgcgggggcg aggagcaggg cggggcccag caggaaccca 1440 

gctttgttag cgatgctccc cgtgagccac gcgccacgcg tacgcgcttc ctcaatgggg 1500 

25 ccgggcgtgg agccgcgccc tgcgcgattg gccaaacggg tggcccacga ttggctgaga 1560 

ccctggcccc cgcctcctcg gccccaggag ggtggggcgt gggtgtgggc tgcgcggcgc 1620 

gtgctgcccc cggggatctt gcgcgcctcc cgaacagccg tgttgtcgcc agggccgcgc 1680 

cttccctccc acagcgcgcg ctgcgcgtgc gaaggtctgg cggctcttgg gactggcggg 1740 

gctgcgcgcg gggttagggt gggggtacgg gaaggctcaa cccaggacct gcgtaccttg 1800 

30 ctttgggggc gcactaagca cctgccggga gcagggggcg caccgggaac tcgcagattt 1860 

cgccagttgg gcgcactggg gatctgtgga ctgcgtccgg gggatgggct agggggacat 1920 

gcgcacgctt tgggccttac agaatgtgat cgcgcgaggg ggagggcgaa gcgtggcggg 1980 

agggcgaggc gaaggaagga gggcgtgaga aaggcgacgg cggcggcgcg gaggagggtt 2040 

atctatacat ttaaaaacca gccgcctgcg ccgcgcctgc ggagacctgg gagagtccgg 2100 

35 ccgcacgcgc gggacacgag cgtcccacgc tccctggcgc gtacggcctg ccaccactag 2160 

gcctcctatc cccgggctcc agacgaccta ggacgcgtgc cctggggagt tgcctggcgg 2220 



1386-WO 



133 



cgccgtgcca gaagccccct tggggcgcca cagttttccc cgtcgcctcc ggttcctctg 2280 

cctgcacctt cctgcggcgc gccgggacct ggagcgggcg ggtggatgca ggcgcgatgg 2340 

acggcggcac actgcccagg tccgcgcccc ctgcgccccc cgtccctgtc ggctgcgctg 2400 

cccggcggag acccgcgtcc ccggaactgt tgcgctgcag ccggcggcgg cgaccggcca 2460 

5 ccgcagagac cggaggcggc gcagcggccg tagcgcggcg c 2 501 

<210> 81 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 81 

aatcctccaa attctaaaaa ca 22 

<210> 82 
20 <211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 82 



30 



35 



aggaaaggga gtgagaaaat 20 

<210> 83 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer 
<400> 83 

ggataggagt tgggattaag at 

<210> 84 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 84 

aaatcttttt caacaccaaa at 

<210> 85 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 85 

aaccctttct tcaaattaca aa 

<210> 86 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



<223> primer 
<400> 86 

tgattgggtt ttagggaaat a 

<210> 87 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 87 

ttgaaaataa gaaaggttga gg 

<210> 88 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 88 

cttctacccc aaatcccta 

<210> 89 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



<223> primer 
<400> 89 

tgtttgggat tgggtagg 

<210> 90 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 90 

cataaccttt acctatctcc tea 

<210> 91 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 91 

ttttagattg aggttttagg gt 

<210> 92 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 



13 8 6 -WO 



137 

<223> primer 
<400> 92 

5 atccattcta cctccttttt ct 22 

<210> 93 

<211> 18 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 93 

ggaggggaga gggttatg 18 

<210> 94 

20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 94 

tactatacac accccaaaac aa 22 

30 

<210> 95 s 

<211> 19 

<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 95 

ttttgggaat gggttgtat 

<210> 96 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 96 

ctacccttaa cctccatcct a 

<210> 97 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 97 

ttgttgggag tttttaagtt tt 

<210> 98 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 



13 8 6 -WO 



<223> primer 
<400> 98 

caaattctcc ttccaaataa at 

<210> 99 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 99 

gtaatttgaa gaaagttgag gg 

<210> 100 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 100 

ccaacaacta aacaaaacct ct 

<210> 101 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



<223> primer 
<400> 101 

ggagttgtat tgttgggaga 

<210> 102 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 102 

taaaacccca attttcacta a 

<210> 103 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 103 

tttgtattag gttggaagtg gt 

<210> 104 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



<223> primer 
<400> 104 

cccaaataaa tcaacaacaa ca 

<210> 105 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 105 

gatttttgga gaggaagtta ag 

<210> 106 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 106 

aaaactaaaa accaaaccca ta 

<210> 107 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



<223> primer 
<400> 107 

tggggttagt ttaggatagg 

<210> 108 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 108 

cttaaaaaca ctaaaacttc tcaaa 

<210> 109 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 109 

tttttgtatt ggggtaggtt t 

<210> 110 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



143 

<223> primer 
<400> 110 

5 cccaactatc tctctcctct ataa 24 

<210> 111 
<211> 25 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 111 

attagaagtg aaagtaatgg aattt 25 

<210> 112 

20 <211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

2 5 <223> primer 

<400> 112 

tcaatttcca aaaaccaac 19 

30 

<210> 113 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

35 

<220> 



1386-WO 



144 

<223> primer 
<400> 113 

5 gggatgggtt attagttgta aa 22 

<210> 114 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 114 

ccttcacaca aaactacaaa aa 22 

<210> 115 

20 . <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 115 



30 



35 



taattgaagg ggttaatagt gg 22 

<210> 116 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> primer 
<400> 116 

aaaaccaaaa ccaaaactaa aa 

<210> 117 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 117 

agtggatttg gagtttagat gt 

<210> 118 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 118 

aacaaaataa aaacttctcc ca 

<210> 119 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 



1386-WO 



<223> primer 
<400> 119 

taggggaaaa gttagagttg ag 

<210> 120 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 120 

cccattaacc cacaaaaa 

<210> 121 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 121 

attttagttt gtgaaatggg at 

<210> 122 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 



13 8 6 -WO 



147 

<223> primer 
<400> 122 

5 tcttaaccaa taacccctca c 21 

<210> 123 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 123 

gtgggttttg ggtagttata ga 22 

<210> 124 
20 <211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 124 

taacctcctc tccttaccaa 20 

30 

<210> 125 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 



1386-WO 



148 

<223> primer 
<400> 125 

5 taggatgggg agagtaatgt tt 22 

<210> 126 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 126 

acaacttatc caacttccat tc 22 

<210> 127 
20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 127 

tcccacaaaa actaaacaat ta 22 

30 

<210> 128 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 



1386-WO 



149 

<223> primer 
<400> 128 

5 aggttttaga tgaaggggtt t 21 

<210> 129 
<211> 23 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 129 

tttggagggt ttagtagaag tta 23 

<210> 130 
20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 130 

cccaataatc acaaaataaa ca 22 

30 

<210> 131 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 



1386-WO 



150 

<223> primer 
<400> 131 

5 atacaacctc aaatcctatc ca 22 

<210> 132 

<211> 22 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 132 

agggagaagg aagttatttg tt 22 

<210> 133 
20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 133 

ggaagatgag gaagttgatt ag 22 

30 

<210> 134 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 134 

cctacaaccc tatcctctaa aa 

<210> 135 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 135 

ttagtagggg tgtgagtgtt tt 

<210> 136 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 136 

caaacaaaac ttctatctca acc 

<210> 137 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 
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152 

<223> primer 
<400> 137 

5 ttatagggtt gagtttggga t 21 

<210> 138 

<211> 22 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 138 

taaacaaaca acaaatcttc ca 22 

<210> 139 
20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

2 5 <223> primer 

<400> 139 

tgaaaatgaa ggtatggagt tt 22 

30 

<210> 140 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 
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153 

<223> primer 
<400> 140 

5 ttaaaaccat ataatccctc ca 22 

<210> 141 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 141 

tatgtttggt tttgttttga ga 22 

<210> 142 

20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 142 

aaccccatca cttttatttc tt 22 

30 

<210> 143 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 143 

gggtgtagaa gtgtttaggt tt 

<210> 144 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 144 

tttctcccct tacaacaata ac 

<210> 145 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 145 

tccccttcca actatatctc tc 

<210> 146 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer 

<400> 146 

5 tgagagtgtt ttagggaagt tt 22 

<210> 147 

<211> 22 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> primer 

15 <400> 147 

aaaaccaaaa cataaaccaa aa 22 

<210> 148 

20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 148 

gattaggagg gtttgttgag at 22 

30 

<210> 149 

<211> 21 

<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 149 

5 aatggttgat gattttggtt t 21 

<210> 150 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 150 

actctcttcc ctatacccct aa 22 

<210> 151 
20 <211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 151 

tgttagtaga gttttaggga ggtt 24 

30 

<210> 152 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 152 

5 acactaccta tccttacccc ac 22 

<210> 153 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 153 

tttttgtttt tatggggtgt at 22 

<210> 154 

20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 154 

ttaaatatcc cttccttaac ca 22 

30 

<210> 155 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 155 

5 agttagaaga ggagttagga tgg 23 

<210> 156 
<211> 22 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 156 

taattttcca atacccattt tc 22 

<210> 157 

20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> primer 

<400> 157 

tgggtagtat ttttgttggt tt 22 

30 

<210> 158 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

35 

<220> 
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<223> primer 
<400> 158 

5 cctaaaaact ctctcatcct ca 22 

<210> 159 
<211> 23 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> primer 
15 <400> 159 

agtggtttag gagtatttgg tta 23 

<210> 160 

20 <211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

2 5 <223> primer 

<400> 160 



aactccctcc atctacaata tc 

30 



22 
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Claims 

1. A method for the amplification of nucleic acids com- 
5 prising the following steps 

1) isolating a nucleic acid sample, 

2) treating said sample in a manner that differenti- 
10 ates between methylated and un-methylated cytosine 

bases within said sample, 

3) amplifying at least one target sequence, within 
said treated nucleic acid, by means of enzymatic am- 

15 plification and a set of primer molecules, wherein 

said primer molecules are characterized in that 

a) each primer molecule sequence reaches a prede- 
fined measure of complexity, 

20 

b) every combination of any two primer molecules 
in the set has a melting temperature below a 
specified threshold temperature, 

2 5 c) every combination of two primer molecules, un- 

der conditions allowing for one or more base mis- 
matches per primer, does not lead to the amplifi- 
cation of an unwanted product when virtually 
tested using the treated and the untreated sample 

3 0 nucleic acids as template, 

and 

4) detecting said amplified target nucleic acid. 



35 2. A method according to claim 1 wherein said primer 

molecules do not contain nucleic acid sequences com- 
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plementary or identical to nucleic acid sequences of 
the target sequence which prior to treatment of step 
2 contained a 5'-CG-3' site. 

5 3 . A method according to claims 1 and 2 wherein said set 
is comprised of at least one but not more than 32 
primer pairs. 

4. A method according to claims 1 and 2 wherein said set 
10 is comprised of at least one but not more than 16 

primer pairs. 

5 . A method according to claims 1 to 4 wherein the 
primer molecules are reaching a specified value of 

15 linguistic complexity. 

6 . A method according to claims 1 to 4 wherein the 
primer molecules are reaching a specified value of 
Shannon entropy. 

20 

7 . A method according to claims 1 to 6 wherein the nu- 
cleic acid sample is isolated from a bodily fluid, a 
cell culture, a tissue sample or a combination the- 
reof . 

25 

8 . A method according to claims 1 to 7 wherein the nu- 
cleic acid sample is comprised of plasmid DNA, BACs, 
YACs or genomic DNA. 

3 0 9. A method according to claims 1 to 7 wherein the nu- 
cleic acid sample is comprised of human genomic DNA 



35 



10. 



A method according to claims 1 to 9 wherein said sam- 
ple is treated by means of a solution of a bisulfite, 
hydrogen sulfite or disulfite. 
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A method according to claims 1 to 10 wherein said 
primer molecule comprises of at least one nucleotide 
within the last three nucleotides from the 3' end of 
the molecule wherein said nucleotide is complementary 
to a nucleotide of the target sequence that was con- 
verted to a different nucleotide by the treatment 
performed in step 2) of claim 1. 

A method according to claims 1 tolO wherein said 
primer molecule comprises of at least one nucleotide 
within the last three nucleotides from the 3' end of 
the molecule wherein said nucleotide is complementary 
to a nucleotide of the target sequence that was con- 
verted to a different nucleotide by bisulfite treat- 
ment . 

A method according to claims 1 to 12 wherein each of 
said primer molecules is characterized in that the 
last at least 5 bases at the 3' end of said primer 
molecule are not complementary to the sequence of any 
other primer molecule in the set. 

A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3 c) of claim 1 is less than 20% of the number 
of nucleotides of the primer molecule. 

A method according to claims 1 to 13 wherein the num- 
ber of nucleotides creating one gap, when aligning 
the primer molecule sequence with the template se- 
quence, allowed for, when virtually testing the am- 
plification of unwanted products according to step 3 
c) of claim 1 is less than 20% of the number of nu- 
cleotides of the primer molecule. 
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A method according to claims 1 to 13 wherein the num 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3c) of claim 1 is less than 10% of the number 
of nucleotides of the primer molecule. 

A method according to claims 1 to 13 wherein the num 
ber of nucleotides creating one gap, when aligning 
the primer molecule sequence with the template se- 
quence, allowed for, when virtually testing the am- 
plification of unwanted products according to step 3 
c) of claim 1 is less than 10% of the number of nu- 
cleotides of the primer molecule. 

A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3c) of claim 1 is less than 5% of the number o: 
nucleotides of the primer molecule. 

A method according to claims 1 to 13 wherein the num- 
ber of nucleotides creating one gap, when aligning 
the primer molecule sequence with the template se- 
quence, allowed for, when virtually testing the am- 
plification of unwanted products according to step 3 
c) of claim 1 is less than 5% of the number of nu- 
cleotides of the primer molecule. 

A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 
step 3 c) of claim 1 is less than seven. 

A method according to claim 20 wherein the number of 
mismatches allowed for is less than five. 
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22. A method according to claim 20 wherein the number of 
mismatches allowed for is less than three. 

23. A method according to claim 20 wherein the number of 
5 mismatches allowed for is one. 

24. A method according to claims 1 to 13 wherein the num- 
ber of mismatches allowed for when virtually testing 
the amplification of unwanted products according to 

10 step 3c) of claim 1 is determined by a pre-specif ied 

maximum melting temperature. 

25. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 

15 quences that prior to treatment of step 2 comprised 

of more than eight 5'-CG-3' sites. 

26. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 

20 quences that prior to treatment of step 2 comprised 

of more than six 5'-CG-3' sites. 

27. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 

25 quences that prior to treatment of step 2 comprised 

of more than four 5'-CG-3' sites. 

28. A method according to claims 1 to 24 wherein said 
primer molecules are used to amplify nucleic acid se- 

3 0 quences that prior to treatment of step 2 comprised 

of more than two 5'-CG-3' sites. 
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29. 



A method according to claims 1 to 28 wherein the 
ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR. 
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A method according to claims 1 to 28 wherein the 
ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR, 
taking as template nucleic acid the coding strand of 
the. treated sample, the non-coding strand of the 
treated sample and both of the strands of the un- 
treated sample. 

A method according to claims 1 to 28 wherein the 
ability of said primer molecules to amplify an un- 
wanted product is tested by means of electronic PCR, 
taking as template nucleic acid the coding strand of 
the bisulfite converted human genome, the non-coding 
strand of the bisulfite converted human genome and 
both of the strands of the untreated human genome. 

A method according to claims 1 to 31 wherein said 
primer molecules are used to amplify nucleic acids 
which are comprised of at least 50 bp but not more 
than 2000 bp. 

A method according to claims 1 to 31 wherein said 
primer molecules are used to amplify nucleic acids 
which are comprised of at least 80 bp but not more 
than 1000 bp. 

A method according to claims 1 to 33 wherein said 
primer molecules are comprised of 16 - 50 nucleo- 
tides . 

A method according to claims 1 to 34 wherein said 
primer molecules do not form dimers with each other. 

A method according to claims 1 to 35 wherein said 
primer molecules do not form loops or hairpin struc- 
tures . 
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A method according to claims 1 to 3 6 wherein said 
primer molecules are complementary to target se- 
quences which prior to the treatment performed in 
step 2) of claim 1 did not contain specified restric- 
tion enzyme recognition sites. 

A method according to claims 1 to 37 wherein said 
primer molecules amplify regions of the treated nu- 
cleic acids which prior to the treatment performed in 
step 2) of claim 1 did not contain specified restric- 
tion enzyme recognition sites. 

A method for designing primers according to claim 1, 
comprising the steps of 

a) selecting a pool of possible primer pairs per am- 
plificate by means of a standard PCR primer design 
program using said nucleic acids as template 

b) excluding those primer pairs which comprise of a 
primer that in combination with another primer mole- 
cule in the same set exceeds a threshold melting tem- 
perature 

c) excluding those primer pairs which comprise of a 
primer that does not reach a specified level of com- 
plexity 

d) excluding those primer pairs which comprise of a 
primer that in combination with another primer mole- 
cule in the same set, under conditions allowing for 
one or more base mismatches per primer, amplifies an 
unwanted product when virtually tested using the 
treated and the untreated sample nucleic acid as tem- 
plate . 
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A method for designing said primer molecules accord- 
ing to claim 1 and 43, adding the step of 

e) excluding from the remaining confirmed primer 
pairs those pairs which in said amplification step do 
not result in the amplification of the intended prod- 
uct when performing a single PCR experiment. 

A method for designing primers according to claims 39 
and 40 , wherein said template nucleic acids are 
masked for repeats and SNPs before designing said 
primer molecules and wherein said standard PCR primer 
design program considers one or more of the following 
factors 

length of amplificate, length of primer, melting tem- 
perature of the primers, dimer formation parameters, 
loop formation parameters, exclusion of unidentified 
or ambiguous nucleotides in the primer sequence, ex- 
clusion of restriction enzyme recognition sites. 

A method according to claims 39 to 41 wherein said 
measure of complexity is a measure of linguistic com- 
plexity. 

A method according to claims 39 to 41 wherein said 
measure of complexity is a measure of Shannon en- 
tropy. 

A method according to claims 39 to 43 wherein the 
following step is carried out prior to performing 
step d) 

excluding from the remaining primer pairs those 
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pairs, which consist of a primer molecule that com- 
prises of at least one CpG site. 

A method according to claims 39 to 44 wherein the 
following step is carried out prior to performing 
step d) 

excluding from the remaining primer pairs those 
pairs, which consist of a primer molecule that does 
not contain at least one nucleotide within the last 
three nucleotides from the 3 ' end of the molecule 
wherein said nucleotide is complementary to a nucleo- 
tide of the target sequence that was converted to a 
different nucleotide by the treatment performed in 
step 2) of claim 1. 

A method according to claims 39 to 45 wherein the 
following step is carried out prior to performing 
step d) 

excluding from the remaining primer pairs those 
pairs, which consist of a primer molecule that con- 
tains more than 5 bases at its 3' end that are com- 
plementary to any other primer molecules ' sequence in 
the set . 

A method according to claim 39 to 46 wherein the fol- 
lowing step is carried out prior to performing step 
d) 

excluding from the remaining primer pairs those 
pairs, which amplify a nucleic acid that did not, 
prior to the treatment in step 2 of claim 1, contain 
at least two CpG sites 
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A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3 c) of claim 1 under conditions 
allowing for a number of mismatching nucleotides of 
2 0% of the number of nucleotides of the primer mole- 
cule . 

A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3 c) of claim 1 under conditions 
allowing for a number of nucleotides creating one 
gap, when aligning the primer molecule sequence with 
the template sequence, of up to 20% of the number of 
nucleotides of the primer molecule. 

A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
pairs, which comprise of one primer molecule that in 
combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3 c) of claim 1 under conditions 
allowing for four or less mismatching base pairs. 
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51. A method according to claim 39 to 47 wherein the fol- 
lowing step is added before performing step d) 

excluding from the remaining primer pairs those 
5 pairs, which comprise of one primer molecule that in 

combination with another primer molecule in the set 
amplifies an unwanted product, when virtually testing 
according to step 3 c) of claim 1 under conditions 
allowing for two or less mismatching base pairs. 



10 



Abstract 



The present invention relates to a method for the ampli- 
5 fication of nucleic acids comprising the following steps 

1) isolating a nucleic acid sample, 

2) treating said sample in a manner that differentiates 
between methylated and un-methylated cytosine bases 
within said sample, 

10 3) amplifying at least one target sequence, within said 

treated nucleic acid, by means of enzymatic amplification 
and a set of primer molecules, wherein said primer mole- 
cules are characterized in that, 

a) each primer molecule sequence reaches a predefined 
15 measure of complexity 

b) every combination of any two primer molecules in 
the set has a melting temperature below a specified 
threshold temperature 

c) every combination of two primer molecules, under 
20 conditions allowing for one or more base mismatches 

per primer, does not lead to the amplification of an 
unwanted product when virtually tested using the 
treated and the untreated sample nucleic acids as 
template 

25 and 4) detecting said amplified target nucleic acid. 

The invention describes a method for amplifying nucleic 
acids, such as DNA with means of an enzymatic amplifica- 
tion step, such as a polymerase chain reaction, specified 

30 for template nucleic acids of low complexity, e.g. pre- 

treated DNA, like but not limited to DNA pre-treated with 
bisulfite is disclosed. The invention is based on the use 
of specific oligo-nucleotide primer molecules to solely 
amplify specific pieces of DNA. It is disclosed how to 

35 optimize the primer design for a PCR if the template DNA 
is of low complexity. 
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Figure 3 
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Figure 4 
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